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(57) Abstract 

The present invention relates to sub- 
stantially pure polypeptides, which has a 
sequence identity of at least 80 % to an 
amino acid sequence disclosed, or which 
is a subsequence of at least 6 amino acids 
thereof, preferably a B- or T-cell epitope 
of the polypeptides disclosed. The polypep- 
tide or the subsequence thereof has at least 
one of nine properties. The use of the 
disclosed polypeptides in medicine is dis- 
closed, preferably as vaccine or diagnostic 
agents relating to virulent Mycobacterium. 
The invention further relates to the nu- 
cleotide sequences disclosed and the nu- 
cleotide sequences encoding the disclosed 
polypeptides. Medical and non-medical use 
of the nucleotide sequences is disclosed. 
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TB vaccine and diagnostic based on antigens from the M. tuberculosis cell 
BACKGROUND OF THE INVENTION 

Human tuberculosis (TB) caused by Mycobacterium tuberculosis is a serious global 
health problem responsible for approximately 3 million deaths annually, according to 
5 WHO. The world-wide incidence of new tuberculosis cases has been progressively falling 
for the last decade but the recent years have markedly changed this trend due to the 
advent of AIDS and the appearance of multidrug resistant strains of Mycobacterium 
tuberculosis. 

10 The only vaccine presently available for clinical use is BCG, a vaccine whose efficacy 
remains a matter of controversy. BCG generally induces a high level of acquired 
resistance in animal models of tuberculosis, but several human trials in developing 
countries have failed to demonstrate significant protection. Notably, BCG is not approved 
by the FDA for use in the United States because BCG vaccination impairs the specificity 

1 5 of the Tuberculin skin test for diagnosis of TB infection. 

This makes the development of a new and improved vaccine against tuberculosis an 
urgent matter which has been given a very high priority by the WHO. Many attempts have 
been made to define the protective Mycobacterial substances and a series of 

20 experiments were conducted to compare the protective efficacy of vaccination with live 
versus killed preparations of M. tuberculosis (Orme IM. Infect.lmmun.1988; 56:3310-12). 
The conclusion of these studies was that vaccination of mice with dead M. tuberculosis 
administered without adjuvants only induced short term protection against TB, whereas 
live M. tuberculosis vaccines induced efficient immunological memory. This information 

25 was the background for the further search for protective substances focused on antigens 
actively secreted from the live Mycobacteria (Andersen P. Infect.lmmun.1994; 62:2536- 
44, Horwitz et al. Proc. Natl Acad. Sci. USA 1995; 92:1530-4, Pal PG et al. Infect.lmmun. 
1992;60:4781-92). 

30 DETAILED DISCLOSURE OF THE INVENTION 

The present inventors conducted a study comparing the long term protection against TB 
after vaccination three times with killed M.tuberculosis administered with DDA as an 
adjuvant with the long term protection obtained with ST-CF, and surprisingly similar levels 
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of long term protection induced in the group receiving killed bacteria were found as in the 
group vaccinated with ST-CF/DDA (figure 1). 

This leads to the conclusion that protective components can be found also among the 
5 components of the cell wall, cell membrane or cytosol derived from a preparation of dead 
virulent Mycobacteria, 

It is thus an object of the present invention to provide a composition for the generation or 
determination of an immune response against a virulent Mycobacterium such as a 
10 vaccine for immunising a mammal, including a human being, against disease caused by 
a virulent Mycobacterium and a diagnostic reagent for the diagnosis of an infection with a 
virulent Mycobacterium. 

By the terms "somatic protein" or "protein derived from the cell wall, the cell membrane or 
15 the cytosol", or by the abbreviation "SPE" is understood a polypeptide or a protein extract 
obtainable from a cell or a part. A preferred method to obtain a somatic protein is 
described in the examples, especially examples 2, 3, 4, and 5. 

By the term "virulent Mycobacterium" is understood a bacterium capable of causing the 
20 tuberculosis disease in a mammal including a human being. Examples of virulent 
Mycobacteria are M. tuberculosis, M. africanum, and M. bovis. 

By "a TB patient" is understood an individual with culture or microscopically proven 
infection with virulent Mycobacteria, and/or an individual clinically diagnosed with TB and 
25 who is responsive to anti-TB chemotherapy. Culture, microscopy and clinical diagnosis of 
TB is well known by the person skilled in the art. 

A significant decrease or increase is defined as a decrease or increase which Is 
significant at the 95% level by comparison of immunised and placebo-treated groups 
30 using an appropriate statistical analysis such as a Student's two-tailed T test. 

By the term "PPD positive individual" is understood an individual with a positive Mantoux 
test or an individual where PPD induces an increase in in vitro recall response 
determined by release of IFN-y of at least 1,000 pg/ml from Peripheral Blood 
35 Mononuclear Cells (PBMC) or whole blood, the induction being performed by the addition 
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of 2.5 to 5 PPD/ml to a suspension comprising about 1 .0 to 2.5 x 10 s PBMC, the 
release of IFN-y being assessable by determination of IFN-y in supernatant harvested 5 
days after the addition of PPD to the suspension compared to the release of IFN-y 
without the addition of PPD. 

By the term "delayed type hypersensitivity reaction" is understood a T-cell mediated 
inflammatory response elicited after the injection of a polypeptide into or application to 
the skin, said inflammatory response appearing 72-96 hours after the polypeptide 
injection or application. 

By the term "IFN-y" is understood interferon-gamma. . 



Throughout this specification, unless the context requires otherwise, the word "comprise", 
or variations thereof such as "comprises" or "comprising", will be understood to imply the 
1 5 inclusion of a stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer or group of elements or integers. 

By the term "a polypeptide" in the present application is generally understood a 
polypeptide of the invention, as will be described later. It is also within the meaning of "a 

20 polypeptide" that several polypeptides can be used, i.e. in the present context "a" means 
"at least one" unless explicitly indicated otherwise. The "polypeptide" is used to referrer to 
short peptides with a length of at least two amino acid residues and at most 10 amino 
acid residues, oligopeptides (11-100 amino acid residues), and longer peptides (the usual 
interpretation of "polypeptide", i.e. more than 100 amino acid residues in length) as well 

25 as proteins (the functional entity comprising at least one peptide, oligopeptide, or 

polypeptide which may be chemically modified by being phosphorylated. glycosylated, by 
being lipidated, or by comprising prosthetic groups). The definition of polypeptides 
comprises native forms of peptides/proteins in Mycobacteria as well as recombinant 
proteins or peptides in any type of expression vectors transforming any kind of host, and 

30 also chemically synthesised polypeptides. Within the scope of the invention is a 

. polypeptide which is at least 6 amino acids long, preferably 7. such as 8. 9, 10. 11,12, 
13, 14 amino acids long, preferably at least 15 amino acids, such as 15, 16, 17, 18, 19. 
20 amino acids long. However, also longer polypeptides having a length of e.g. 25, 50, 
75. 100, 125, 150, 175 or 200 amino acids are within the scope of the present invention. 

35 
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In the present context the term "purified polypeptide" means a polypeptide preparation 
which contains at most 5% by weight of other polypeptide material with which it is natively 
associated (lower percentages of other polypeptide material are preferred, e.g. at most 
4%, at most 3%, at most 2%, at most 1%, and at most 14%). It is preferred that the 
5 substantially pure polypeptide is at least 96% pure, i.e. that the polypeptide constitutes at 
least 96% by weight of total polypeptide material present in the preparation, and higher 
percentages are preferred, such as at least 97%, at least 98%, at least 99%, at least 
99,25%, at least 99,5%, and at least 99,75%. It is especially preferred that the 
polypeptide is in "essentially pure form", i.e. that the polypeptide is essentially free of any 

10 other antigen with which it is natively associated, i.e. free of any other antigen from 
bacteria belonging to the tuberculosis complex. This can be accomplished by preparing 
the polypeptide by means of recombinant methods in a non-mycobacterial host cell as 
will be described in detail below, or by synthesising the polypeptide by the well-known 
methods of solid or liquid phase peptide synthesis, e.g. by the method described by 

1 5 Merrifield or variations thereof. 

By the term "non-naturally occurring polypeptide" is understood a polypeptide that does 
not occur naturally. This means that the polypeptide is substantially pure, and/or that the 
polypeptide has been synthesised in the laboratory, and/or that the polypeptide has been 
20 produced by means of recombinant technology. 

By the terms "analogue" and "subsequence" when used in connection with polypeptides 
is meant any polypeptide having the same immunological characteristics as the 
polypeptides of the invention described above with respect to the ability to confer 
25 increased resistance to infection with virulent Mycobacteria. Thus, included is also a 
polypeptide from a different source, such as from another bacterium or even from a 
eukaryotic cell. 

The term "sequence identity" indicates a quantitative measure of the degree of homology 
30 between two amino acid sequences of equal length or between two nucleotide 

sequences of equal length. If the two sequences to be compared are not of equal length, 
they must be aligned to best possible fit. The sequence identity can be calculated as 

^ Nnr ^ w ° , wherein N dif is the total number of non-identical residues in the two sequences 

when aligned and wherein N re f is the number of residues in one of the sequences. Hence, 
35 the DNA sequence AGTCAGTC will have a sequence identity of 75% with the sequence 
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AATCAATC (N di ,=2 and N ref =8). A gap is counted as non-identity of the specific 
residue(s), i.e. the DNA sequence AGTGTC will have a sequence identity of 75% with the 
DNA sequence AGTCAGTC (N di ,=2 and N, e p8). Sequence identity can alternatively be 
calculated by the BLAST program e.g. the BLASTP program or the BLASTN program 
5 (Pearson W.R and D.J. Lipman (1988) PNAS USA 85:2444- 

2448)(www.ncbi.nlm.nih.gov/BLAST). In one aspect of the invention, alignment is 
performed with the global align algorithm with default parameters as described by X. 
Huang and W. Miller. Adv. Appl. Math. (1991) 12:337-357, available at 
http://www.ch.embnet.org/software/LALIGN_form.html. 

10 

When the term nucleotide is used in the following, it should be understood in the broadest 
sense. That is, most often the nucleotide should be considered as DNA. However, when 
DNA can be substituted with RNA, the term nucleotide should be read to include RNA 
embodiments which will be apparent for the person skilled in the art. For the purposes of 
15 hybridisation, PNA or LNA may be used instead of DNA. PNA has been shown to exhibit 
a very dynamic hybridisation profile and is described in Nielsen P E et al., 1991 , Science 
254: 1497-1500). LNA (Locked Nucleic Acids) is a recently introduced oligonucleotide 
analogue containing bicyclo nucleoside monomers (Koshkin et al., 1998, 54, 3607- 
3630;Nielsen, N.K. et al. J.Am.Chem.Soc 1998, 120, 5458-5463). 

20 

It is surprisingly demonstrated herein that the SPE comprising polypeptides isolated from 
the cell wall, cell membrane and cytosol induces protective immunity against infection 
with M.tuberculosis in an animal model, when injected with an adjuvant. It is 
contemplated that these polypeptides, either alone or in combination, can be used as 
25 vaccine components. 

It is further demonstrated that several polypeptides isolated from the cell wall, cell 
membrane or cytosol are recognised by human tuberculosis antisera. Therefore it is 
considered likely that these polypeptides, either alone or in combination, can be useful as 
30 diagnostic reagents in the diagnosis of tuberculosis. 

One embodiment of the invention relates to a method for producing a polypeptide in an 
immunological composition comprising the steps of: 
a) killing a sample of virulent Mycobacteria; 
35 b) centrifugating the sample of a); 
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c) resuspending the pellet of b) in PBS; 

d) centrifugating the suspension of c); 

e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane 
from the supernatant of d) with SDS; 

5 f) centrifugating the extract of e); 

g) precipitating the supernatant of f) in cold acetone; 

h) resuspending the precipitate of g) in PBS; 

i) applying the resuspension of h) to 2 dimensional electrophoresis; 
j) blotting the gel of i) to a PVDF membrane; 

10 k) subjecting the spots on j) to N-terminal sequencing; 

I) searching a database for homology with the sequence of k) to identify the nucleotide 
sequence; 

m) cloning the nucleotide sequence of I) into an expression system; 
n) isolating and purifying the polypeptide expressed in m); and 
15 o) formulating the polypeptide of n) with an adjuvant substance in an immunological 
composition. 

Another embodiment is a method of producing a polypeptide originating from the cell wall 
in an immunological composition, said method comprising the steps of: 
20 a) killing a sample of virulent Mycobacteria] 

b) centrifugating the sample of a) 

c) resuspending the pellet of b) in PBS supplemented with EDTA and 
phenylmethylsulfonyl fluoride and sonicating for 15 min 

d) lysing the suspension of c) 

25 e) centrifugating the lysed suspension of d) 

f) resuspending the pellet of e) in homogenising buffer 

g) incubating the suspension of 0 with RNase and DNase overnight 

h) incubating the suspension of g) with SDS 

i) centrifugating the incubated suspension of h) 
30 j) incubating the supernatant of i) with SDS 

k) precipitating the incubated supernatant of j) with acetone 
I) resuspending the precipitate of k) in PBS 
m) subjecting the suspension of I) to a Triton X-1 14 extraction 
n) applying the resuspension of m) to 2 dimensional electrophoresis; 
35 o) blotting the gel of n) to a PVDF membrane; 



WO 00/21983 



7 



PCT/DK99/00538 



p) subjecting the spots on o) to N-terminal sequencing; 

q) searching a database for homology with the sequence of p) to identify the nucleotide 
sequence; 

r) cloning the nucleotide sequence of q) into an expression system; 
5 s) isolating and purifying the polypeptide expressed in r); and 

t) formulating the polypeptide of s) with an adjuvant substance in an immunological 
composition. 

A third embodiment is a method of producing a polypeptide originating from the cell 
10 membrane in an immunological composition, said method comprising the steps of: 

a) killing a sample of virulent Mycobacteria] 

b) centrifugating the sample of a) 

c) resuspending the pellet of b) in PBS supplemented with EDTA and 
phenylmethylsulfonyl fluoride and sonicating for 15 min 

15 d) lysing the suspension of c) 

e) centrifugating the lysed suspension of d) 

f) ultracentrifugating the supernatant of e) 

g) resuspending the pellet of f) in PBS 

h) subject the suspension of g) to a Triton X-1 14 extraction 

20 i) applying the resuspension of h) to 2 dimensional electrophoresis; 
j) blotting the gel of i) to a PVDF membrane; 
k) subjecting the spots on j) to N-terminal sequencing; 

I) searching a database for homology with the sequence of k) to identify the nucleotide 
sequence; 

25 m) cloning the nucleotide sequence of I) into an expression system; and 
n) isolating and purifying the polypeptide expressed in m); 

o) formulating the polypeptide of n) with an adjuvant substance in an immunological 
composition. 

30 A fourth embodiment is a method of producing a polypeptide originating from the cytosol 
in an immunological composition comprising the steps of: 

a) killing a sample of virulent Mycobacteria] 

b) centrifugating the sample of a) 

c) resuspending the pellet of b) in PBS supplemented with EDTA and 
35 phenylmethylsulfonyl fluoride and sonicating for 15 min 
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d) lysing the suspension of c) 

e) centrifugating the lysed suspension of d) 

f) ultracentrifugating the supernatant of e) 

g) precipitating the supernatant of f) with acetone 
5 h) resuspending the precipitate of g) in PBS 

i) applying the resuspension of h) to 2 dimensional electrophoresis; 

j) plotting the gel of i) to a PVDF membrane; 

k) subjecting the spots on j) to N-terminal sequencing; 

I) searching a database for homology with the sequence of k) to identify the nucleotide 
10 sequence; 

m) cloning the nucleotide sequence of I) into an expression system; 
n) isolating and purifying the polypeptide expressed in m); and 

0) formulating the polypeptide of n) with an adjuvant substance in an immunological 
composition. 

15 

In particular, the invention relates to a polypeptide obtainable by a method as described 
above which polypeptide has at least one of the following properties: 

1) it induces an in vitro recall response determined by a release of IFN-y of at least 1,500 
20 pg/ml from reactivated memory T-lymphocytes withdrawn from a C57BI/6J mouse within 

4 days after the mouse has been rechallenged with 1 x 10 6 virulent Mycobacteria, the 
induction being performed by the addition of the polypeptide to a suspension comprising 
about 2 x 10 5 cells isolated from the spleen of said mouse, the addition of the polypeptide 
resulting in a concentration of the polypeptide of not more than 20 jug per ml suspension, 
25 the release of IFN-y being assessable by determination of IFN-y in supernatant harvested 
3 days after the addition of the polypeptide to the suspension, 

ii) it induces an in vitro response during primary infection with virulent Mycobacteria, 
determined by release of IFN-y of at least 1 ,500 pg/ml from T-lymphocytes withdrawn 

30 from a mouse within 28 days after the mouse has been infected with 5 x 10 4 virulent 
Mycobacteria, the induction being performed by the addition of the polypeptide to a 
suspension comprising about 2 x 10 5 cells isolated from the spleen, the addition of the 
polypeptide resulting in a concentration of not more than 20 ^ig per ml suspension, the 
release of IFN-y being assessable by determination of IFN-y in supernatant harvested 3 

35 days after the addition of the polypeptide to the suspension, 
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iii) it induces a protective immunity determined by vaccinating an animal model with the 
polypeptide and an adjuvant in a total of three times with two weeks interval starting at 6- 
8 weeks of age, 6 weeks after the last vaccination challenging with 5 x 10 6 virulent 
5 Mycobacterials by aerosol and determining a significant decrease in the number of 
bacteria recoverable from the lung 6 weeks after the animal has been challenged, 
compared to the number recovered from the same organ in a mammal given placebo 
treatment, 

10 iv) it induces in vitro recall response determined by release of IFN-y of at least 1 ,000 
pg/ml from Peripheral Blood Mononuclear Cells (PBMC) or whole blood withdrawn from 
TB patients 0-6 months after diagnosis, or PPD positive individual, the induction being 
performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 
x 10 5 PBMC or whole blood cells, the addition of the polypeptide resulting in a 

15 concentration of not more than 20 fig per ml suspension, the release of IFN-y being 
assessable by determination of IFN-y in supernatant harvested 5 days after the addition 
of the polypeptide to the suspension, 

v) it induces a specific antibody response in a TB patient as determined by an ELISA 
20 technique or a western blot when the whole blood is diluted 1 :20 in PBS and stimulated 

with the polypeptide in a concentration of at the most 20 ^ig/ml and induces an OD of at 
least 0.1 in ELISA, or a visual response in western blot. 

vi) it induces a positive in vitro response determined by release of IFN-y of at least 500 
25 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an individual 

who is clinically or subclinical^ infected with a virulent Mycobacterium, the induction 
being performed by the addition of the polypeptide to a suspension comprising about 1 .0 
to 2.5 x 10 5 PBMC, the addition of the polypeptide resulting in a concentration of not 
more than 20 fig per ml suspension, the release of IFN-y being assessable by determi- 
30 nation of IFN-y in supernatant harvested 5 days after the addition of the polypeptide to 
the suspension, and preferably does not induce such an IFN-y release in an individual not 
infected with a virulent Mycobacterium, 

vii) it induces a positive in vitro response determined by release of IFN-y of at least 500 
35 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an individual 
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clinically or subclinical^ infected with a virulent Mycobacterium, the induction being 
performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 
x 10 5 PBMC, the addition of the polypeptide resulting in a concentration of not more than 
20 ng per ml suspension, the release of IFN-y being assessable by determination of IFN-y 
5 in supernatant harvested 5 days after the addition of the polypeptide to the suspension, 
and preferably does not induce such an IFN-y release in an individual not infected with a 
virulent Mycobacterium, 

viii) it induces a positive DTH response determined by intradermal injection or local 

10 application patch of at most 100 fig of the polypeptide to an individual who is clinically or 
subclinical^ infected with a virulent Mycobacterium, a positive response having a 
diameter of at least 10 mm 72-96 hours after the injection or application, 

ix) it induces a positive DTH response determined by intradermal injection or local 

15 application patch of at most 100 ^g of the polypeptide to an individual who is clinically or 
subclinical^ infected with a virulent Mycobacterium, a positive response having a 
diameter of at least 10 mm 72-96 hours after the injection, and preferably does not 
induce a such response in an individual who has a cleared infection with a virulent 
Mycobacterium. , 



20 



Any polypeptide fulfilling one or more of the above properties and which is obtainable 
from either the cell wall, cell membrane or the cytosol is within the scope of the present 
invention. 



25 The property described in i) will also be satisfied if the release of IFN-y from reactivated 
memory T-lymphocytes is 2,000 pg/ml, such as 3,000 pg/ml. In an alternative 
embodiment of the invention, the immunological effect of the polypeptide could be 
determined by comparing the IFN-y release as described with the IFN-y release from a 
similar assay, wherein the polypeptide is not added, a significant increase being 

30 indicative of an immunologically effective polypeptide. In a preferred embodiment of the 
invention, the addition of the polypeptide results in a concentration of not more than 20 
jig per ml suspension, such as 15 jig, 10 pg, 5 ug, 3 pg, 2 M g, or 1 pg polypeptide per ml 
suspension. 
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The property mentions as an example the mouse strain G57BI/6] as the animal model. As 
will be known by a person skilled in the art, due to genetic variation, different strains may 
react with immune responses of varying strength to the same polypeptide. It is presently 
unknown which strains of mice will give the best predictability of immunogenic reactivity 
5 in which human population. Therefore, it is important to test other mouse strains, such as 
C3H/HeN, CBA (preferably CBA/J), DBA (preferably DBA/2 J), A/J, AKR/N, DBA/1 J, 
FVB/N, SJL/N, 129/SvJ, C3H/HeJ-Lps or BALB mice (preferably BALB/cA, BALB/cJ). It is 
presently contemplated that also a similar test performed in another animal model such 
as a guinea pig or a rat will have clinical predictability. In order to obtain good clinical 
10 predictability to humans, it is contemplated that any farm animal, such as a cow, pig, or 
deer, or any primate will have clinical predictability and thus serve as an animal model. 

It should be noted, moreover, that tuberculosis disease also affects a number of different 
animal species such as cows, primates, guinea pigs, badgers, possums, and deers. A 
15 polypeptide which has proven effective in any of the models mentioned above may be of 
interest for animal treatment even if it is not effective in a human being. 

It is proposed to measure the release of IFN-y from reactivated T lymphocytes withdrawn 
from a C57BI/6j mouse within 4 days after the mouse has been rechallenged with virulent 

20 Mycobacteria. This is due to the fact that when an immune host mounts a protective 
immune response, the specific T-cells responsible for the early recognition of the infected 
macrophage stimulate a powerful bactericidal activity through their production of IFN-y 
(Rook, G.A.W. (1990) Res. Microbiol. 141:253-256; Flesch, I. et S.H.E. Kaufmann ( 1987) 
J Immunol. 1 38(1 2):4408-1 3). However other cytokines could be relevant when 

25 monitoring the immunological response to the polypeptide, such as IL-12, TNF-a, IL-4, IL- . 
5, IL-10, IL-6, TGF-(3. Usually one or more cytokines will be measured utilising for 
example the PCR technique or ELISA. It will be appreciated by the person skilled in the 
art that a significant increase or decrease in the amount of any of these cytokines 
induced by a specific polypeptide can be used in evaluation of the immunological efficacy 

30 of the polypeptide. The ability of a polypeptide to induce a IFN-y response is presently 
believed to be the most relevant correlate of protective immunity as mice with a disruption 
of the gene coding for IFN-y are unable to control a mycobacterial infection and die very 
rapidly with widespread dissemination, caseous necrosis and large abscesses (Flynn et 
al (1993) J.Exp.Med 178: 2249-2254, Cooper et al (1993) J.Exp.Med. 178:2243-2248). A 

35 specific model for obtaining information regarding the antigenic targets of a protective 
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immunity in the memory model was originally developed by Lefford (Lefford et al (1973) 
Immunology 25:703) and has been used extensively in the recent years (Orme et al 
(1988). Infect.lmmun. 140:3589, P.Andersen and I. Heron (1993) J.lmmunol.154:3359). 

5 The property described in ii) will also be satisfied if the release of IFN-y from T- 

lymphocytes withdrawn during primary infection is 2,000 pg/ml, such as 3,000 pg/ml. The 
comments on property i) regarding a significant increase in IFN-y, concentration of 
polypeptide, animal model, and other cytokines are equally relevant to property ii), and 
wee versa. 

10 

The property described in iii) will also be satisfied if the protective immunity is determined 
by challenging the mouse more than 6 weeks after the last vaccination challenge such as 
7 weeks, preferably 8 weeks, 9 weeks, 10 weeks, 1 1 weeks, 12 weeks or 15 weeks. In 
one embodiment of the invention the bacteria are recovered from the spleen more than 6 

15 weeks after the last vaccination challenge such as 7 weeks, preferably 8 weeks, 9 weeks, 
10 weeks, 1 1 weeks, 12 weeks or 15 weeks. In another embodiment of the invention, the 
last vaccination challenge is given subcutaneously with 5x1 0 4 virulent Mycobacteria. As 
will be known by the person skilled in the art, the number of viable bacteria in the lung is 
presently considered to be relevant to the degree of bacterial infection of the animal. An 

20 equally important measure is the determination of the number of viable bacteria in the 
spleen, lymph node, or blood. 

The amount of polypeptide and adjuvant used for vaccinating will depend on the animal 
model used, e.g. the mouse strain. When a mouse model is used it is preferred that the 
25 amount of polypeptide used for vaccinating the mouse is between 2 and 20 ^ig, such as 
between 5 and 15 ^ig, preferably 10 \xg. For larger animals such as guinea pigs, deers, 
cows, primates, badgers, and possums higher doses such as 5 to 50 ng of a single 
polypeptide are preferred. 

30 The comments on property i) regarding concentration of polypeptide and animal model 
are equally relevant to property iii), and vice versa. 



35 



In another aspect of property iii), the mice, or other animal model, are given the standard 
lethal dose of virulent Mycobacteria. The standard lethal dose varies from around 3x10 s 
to around 5x1 0 6 virulent Mycobacteria depending on the specific strain of virulent 
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Mycobacteria and strain of mice. The mortality in the mice is then monitored and 
compared to a placebo vaccinated control group. A significant decrease in mortality, 
measured as the mean survival time, will be indicative of an immunologically effective 
polypeptide. In a very recent paper it is shown that there is good correlation between 
5 mortality of the individual animals and the bacterial counts in the same animals. 
(S.Baldwin (1998) Infect.lmmun 66:2951-2959). 

The property described in iv) will also be satisfied if the release of IFN-y from PBMC is 
determined in PBMC withdrawn from TB patients or PPD positive individuals more than 6 
10 months after diagnosis such as 9 months, 1 year, 2 years, 5 years, or 10 years after 
diagnosis. 

The comments on property i) regarding significant increase in IFN-y, concentration of 
polypeptide, and other cytokines are equally relevant to property iv). 

15 

The property described in v) will in particular be satisfied, if the ELISA is performed as 
follows: the polypeptide of interest in the concentration of 1 to 10 ^g/ml is coated on a 96 
wells polystyrene plate (NUNC, Denmark) and after a washing step with phosphate buffer 
pH 7.3, containing 0.37 M NaCI and 0.5% Tween-20 the serum or plasma from a TB 
20 patient is applied in dilution's from 1:10 to 1 :1000 in PBS with 1% Tween-20. Binding of 
an antibody to the polypeptide is determined by addition of a labeled (e.g. peroxidase 
labeled) secondary antibody and reaction is thereafter visualized by the use of OPD and 
H 2 0 2 as described by the manufacturer (DAKO, Denmark). The OD value in each well is 
determined using an appropriate ELISA reader. 

25 

In a preferred embodiment the western blot is performed as follows: The polypeptide is 
applied in concentrations from 1-40 ng to a SDS-PAGE and after electrophoresis the 
polypeptide is transferred to a membrane e.g. nitrocellulose or PVDF. The membrane is 
thereafter washed in phosphate buffer, pH 7.3, containing 0.37 M NaCI and 0.5% Tween- 
30 20 for 30 min. The sera obtained from one or more TB patients were diluted 1 : 10 to 
1:1000 in phosphate buffer pH 7.3 containing 0.37 M NaCI: The membrane is hereafter 
washed four times five minutes in binding buffer and incubated with peroxidase- or 
phosphates-labeled secondary antibody. Reaction is then visualized using the staining 
method recommended by the manufacture (DAKO, Denmark). 
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The property described in vi) will in particular be satisfied if the polypeptide does not 
induce such an IFN-y release in an individual not infected with a virulent Mycobacterium, 
i.e. an individual who has been BCG vaccinated or infected with Mycobacterium avium or 
sensitised by non-tuberculosis Mycobacterium (NTM). The comments on property i) 
5 regarding significant increase in IFN-y, concentration of polypeptide, and other cytokines 
are equally relevant to property vi). 

The property described in vii) will in particular be satisfied if the polypeptide does not 
induce such an IFN-y release in an individual cleared of an infection with a virulent 
10 Mycobacterium, i.e. which does not have any positive culture, microscopically or clinically 
proven ongoing infection with virulent Mycobacterium. The comments on property i) 
regarding significant increase in IFN-y, concentration of polypeptide, and other cytokines 
are equally relevant to property vii). 

15 The property described in viii) will in particular be satisfied if the polypeptide does not 
induce such a response in an individual not infected with a virulent Mycobacterium, i.e. 
an individual who has been BCG vaccinated or infected with Mycobacterium avium or 
sensitised by non-tuberculosis Mycobacterium. In a preferred embodiment the amount of 
polypeptide intradermal^ injected or applied is 90 jkj, such as 80^g, 70 |ag, 60 jig, 

20 50 fxg, 40 jxg, or 30 jag. In another embodiment of the invention, the diameter of the 
positive response is at least 11 mm, such as 12 mm, 13 mm, 14 mm, or 15 mm. In a 
preferred embodiment the induration of erythema or both could be determined after 
administration of the polypeptide by intradermal injection, patch test or multipuncture. The 
reaction diameter could be positive after more than 48, such as 72 or 96 hours. 

25 

The property described in ix) will in particular be satisfied if the polypeptide does not 
induce such a response in an individual cleared of an infection with a virulent 
Mycobacterium, i.e. which does not have any positive culture or microscopically proven 
ongoing infection with virulent Mycobacterium. The comments on property viii) regarding 
30 the amount of polypeptide intradermal^ injected or applied and the diameter of the 
positive response are equally relevant to property ix). 

Preferred embodiments of the invention are the specific polypeptides which have been 
identified and analogues and subsequences thereof. It has been noted that none of the 
35 identified polypeptides in the examples include a signal sequence. 
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Until the present invention was made, it was unknown that the polypeptides with the 
amino acid sequences disclosed in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 are expressed in live virulent Mycobacterium. 
5 These polypeptides in purified form, or non-naturally occurring, i.e. recombinantly or 
synthetically produced, are considered part of the invention. It is understood that a 
polypeptide which has any of the properties i) - ix) and has a sequence identity of at least 
80% with any of the amino acid sequences shown in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 or has a sequence identity of 

10 at least 80% to any subsequence thereof is considered part of the invention. In a 
preferred embodiment the sequence identity is at least 80%, such as 81%, 82%, 83%, 
84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
99% or 99.5%. Furthermore, any T cell epitope of the polypeptides disclosed in SEQ ID 
NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 is 

15 considered part of the invention. Also, any B-cell epitope of the polypeptides disclosed in 
SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 
and 79 is considered part of the invention. 

Although the minimum length of a T-cell epitope has been shown to be at least 6 amino 
20 acids, it is normal that such epitopes are constituted of longer stretches of amino acids. 
Hence it is preferred that the polypeptide fragment of the invention has a length of at 
least 7 amino acid residues, such as at least 8, at least 9, at least 10, at least 12, at least 
14, at least 16, at least 18, at least 20, at least 22, at least 24, or at least 30 amino acid 
residues. 

25 

In both immunodiagnostics and vaccine preparation, it is often possible and practical to 
prepare antigens from segments of a known immunogenic protein or polypeptide. Certain 
epitopic regions may be used to produce responses similar to those produced by the 
entire antigenic polypeptide. Potential antigenic or immunogenic regions may be 

30 identified by any of a number of approaches, e.g., Jameson-Wolf or Kyte-Doolittle 
antigenicity analyses or Hopp and Woods (Hopp et Woods, (1981), Proc Natl Acad Sci 
USA 78/6:3824-8) hydrophobicity analysis (see, e.g., Jameson and Wolf, (1988) Comput 
Appl Biosci, 4(1):181-6; Kyte and Doolittle, (1982) J Mol Biol, 157(1): 105-32; or U.S. 
Patent No. 4,554,101). Hydrophobicity analysis assigns average hydrophilicity values to 

35 each amino acid residue; from these values average hydrophilicities can be calculated 
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and regions of greatest hydrophilicity determined. Using one or more of these methods, 
regions of predicted antigenicity may be derived from the amino acid sequence assigned 
to the polypeptides of the invention. Alternatively, in order to identify relevant T-cell 
epitopes which are recognised during an immune response, it is also possible to use a 
5 "brute force" method: Since T-cell epitopes are linear, deletion mutants of polypeptides 
will, if constructed systematically, reveal what regions of the polypeptide are essential in 
immune recognition, e.g. by subjecting these deletion mutants to the IFN-y assay 
described herein. A presently preferred method utilises overlapping oligomers (preferably 
synthetic ones having a length of e.g. 20 amino acid residues) derived from the 
10 polypeptide. Some of these will give a positive response in the IFN-y assay whereas 
others will not. A preferred T-cell epitope is a T-helper cell epitope or a cytotoxic T-cell 
epitope. 

B-cell epitopes may be linear or spatial. The three-dimensional structure of a protein is 
15 often such that amino acids, which are located distant from each other in the one- 
dimensional structure, are located near to each other in the folded protein. Within the 
meaning of the present context, the expression epitope is intended to comprise the one- 
and three-dimensional structure as well as mimics thereof. The term is further intended to 
include discontinuous B-cell epitopes. The linear B-cell epitopes can be identified in a 
20 similar manner as described for the T-cell epitopes above. However, when Identifying B- 
cell epitopes the assay should be an ELISA using overlapping oligomers derived from the 
polypeptide as the coating layer on a microtiter plate as described elsewhere. 

A non-naturally occurring polypeptide, an analogue, a subsequence, a T-cell epitope 
25 and/or a B-cell epitope of any of the described polypeptides are defined as any non- 
naturally occurring polypeptide, analogue, subsequence, T-cell epitope and/or B-cell 
epitope of any of the polypeptides having any of the properties i)-ix). 

Table 1 lists the antigens of the invention. 



30 
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Table 1 The antigens of the invention by the names used herein as well as by reference 
to relevant SEQ ID NOs of N-terminal sequences, full amino acid sequences and 
sequences of nucleotides encoding the antigens 



Antigen 


N-Terminal sequence 
SEQ ID NO: 


Nucleotide 
sequence 
SEQ ID NO: 


Ammo aciu sequence 
SEQ ID NO: 


TB10C 


45 


1 


2 


TB13A 


50 


3 


4 


TB15 


39 


5 


6 


TB15A 


46 


7 


8 


TB17 


47 


9 


10 


TB18 


40 


11 


12 


TB21 


41 


13 


14 


TB24 


48 


15 


16 


TB27B 


49 


17 


18 


TB33 


42 


19 


20 


TB38 


43 


21 


22 


TB54 


44 


23 


24 


TB64 


57 


25 


26 


TB11B 


51 


27 


28 


TB16 


52 


29 


30 


TB16A 


53 


31 


32 


TB32 


54 


33 


34 


TB32A 


55 


35 


36 


TB51 


56 


37 


38 


TB12.5 


80 


74 


75 


TB20.6 


81 


76 


77 


TB40.8 


82 


78 


79 



5 Each of the polypeptides may be characterised by specific amino acid and nucleic acid 
sequences. It will be understood that such sequences include analogues and variants 
produced by recombinant methods wherein such nucleic acid and polypeptide sequences 
have been modified by substitution, insertion, addition and/or deletion of one or more 
nucleotides in said nucleic acid sequences to cause the substitution, insertion, addition or 
10 deletion of one or more amino acid residues in the recombinant polypeptide. A preferred 
nucleotide sequence encoding a polypeptide of the invention is a nucleotide sequence 
which 
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1) is a nucleotide sequence selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 and 78 or an analogue of 
said sequence which hybridises with any of the nucleotide sequences shown in SEQ ID 
NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78 or a 
5 nucleotide sequence complementary thereto, or a specific part thereof, preferably under 
stringent hybridisation conditions. By stringent conditions is understood, as defined in the 
art, 5-1 0°C under the melting point T m , cf. Sambrook et al, 1989, pages 11.45-11.49, 
and/or 

10 2) encodes a polypeptide, the amino acid sequence of which has a 80% sequence 

identity with an amino acid sequence selected from the group consisting of SEQ ID NOs: 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 and/or 

3) constitutes a subsequence of any of the above mentioned nucleotide sequences, 
15 and/or 

4) constitutes a subsequence of any of the above mentioned polypeptide sequences. 

The terms "analogue" or "subsequence" when used in connection with the nucleotide 
20 fragments of the invention are thus intended to indicate a nucleotide sequence which 
encodes a polypeptide exhibiting identical or substantially identical immunological 
properties to a polypeptide encoded by the nucleotide fragment of the invention shown in 
any of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 
76 or 78, allowing for minor variations which do not have an adverse effect on the ligand 
25 binding properties and/or biological function and/or immunogenicity as compared to any 
of the polypeptides of the invention or which give interesting and useful novel binding 
properties or biological functions and immunogenicities etc. of the analogue and/or 
subsequence. The analogous nucleotide fragment or nucleotide sequence may be 
derived from a bacterium, a mammal, or a human or may be partially or completely of 
30 synthetic origin. The analogue and/or subsequence may also be derived through the use 
of recombinant nucleotide techniques. 

Furthermore, the terms "analogue" and "subsequence" are intended to allow for 
variations in the sequence such as substitution, insertion (including introns), addition, 
35 deletion and rearrangement of one or more nucleotides, which variations do not have any 
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substantial effect on the polypeptide encoded by a nucleotide fragment or a subsequence 
thereof. The term "substitution" is intended to mean the replacement of one or more 
nucleotides in the full nucleotide sequence with one or more different nucleotides, 
"addition" is understood to mean the addition of one or more nucleotides at either end of 
5 the full nucleotide sequence, "insertion" is intended to mean the introduction of one or 
more nucleotides within the full nucleotide sequence, "deletion" is intended to indicate 
that one or more nucleotides have been deleted from the full nucleotide sequence 
whether at either end of the sequence or at any suitable point within it, and "re- 
arrangement" is intended to mean that two or more nucleotide residues have been 
10 exchanged with each other. 

It is well known that the same amino acid may be encoded by various codons, the codon 
usage being related, inter alia, to the preference of the organisms in question expressing 
the nucleotide sequence. Thus, at least one nucleotide or codon of a nucleotide fragment 
15 of the invention may be exchanged by others which; when expressed, results in a 
polypeptide identical or substantially identical to the polypeptide encoded by the 
nucleotide fragment in question. 

The term "subsequence" when used in connection with the nucleic acid fragments of the 
20 invention is intended to indicate a continuous stretch of at least 10 nucleotides which ex- 
hibits the above hybridization pattern. Normally this will require a minimum sequence 
identity of at least 70% with a subsequence of the hybridization partner having SEQ ID 
NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78. It is 
preferred that the nucleic acid fragment is longer than 10 nucleotides, such as at least 15, 
25 at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 
55, at least 60, at least 65, at least 70, and at least 80 nucleotides long, and the 
sequence identity should preferable also be higher than 70%, such as at least 75%, at 
least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 96%, and at 
least 98%. It is most preferred that the sequence identity is 100%. Such fragments may 
30 be readily prepared by, for example, directly synthesizing the fragment by chemical 
means, by application of nucleic acid reproduction technology, such as the PCR tech- 
nology of U.S. Patent 4,603,102, or by introducing selected sequences into recombinant 
vectors for recombinant production. 
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The nucleotide sequence to be modified may be of cDNA or genomic origin as discussed 
above, but may also be of synthetic origin. Furthermore, the sequence may be of mixed 
cDNA and genomic, mixed cDNA and synthetic or genomic and synthetic origin as 
discussed above. The sequence may have been modified, e.g. by site-directed mu- 
5 tagenesis, to result in the desired nucleic acid fragment encoding the desired polypep- 
tide. 

The invention also relates to a replicable expression vector which comprises a nucleic 
acid fragment defined above, especially a vector which comprises a nucleic acid frag- 

10 ment encoding a polypeptide fragment of the invention. The vector may be any vector 
which may conveniently be subjected to recombinant DNA procedures, and the choice of 
vector will often depend on the host cell into which it is to be introduced. Thus, the vector 
may be an autonomously replicating vector, i.e. a vector which exists as an extrachromo- 
somal entity, the replication of which is independent of chromosomal replication; 

15 examples of such a vector are a plasmid, phage, cosrnid, mini-chromosome and virus. 
Alternatively, the vector may be one which, when introduced in a host cell, is integrated in 
the host cell genome and replicated together with the chromosome(s) into which it has 
been integrated. 

20 Expression vectors may be constructed to include any of the DNA segments disclosed 
herein. Such DNA might encode an antigenic protein specific for virulent strains of 
mycobacteria or even hybridization probes for detecting mycobacteria nucleic acids in 
samples. Longer or shorter DNA segments could be used, depending on the antigenic 
protein desired. Epitopic regions of the proteins expressed or encoded by the disclosed 

25 DNA could be included as relatively short segments of DNA. A wide variety of expression 
vectors is possible including, for example, DNA segments encoding reporter gene 
products useful for identification of heterologous gene products and/or resistance genes 
such as antibiotic resistance genes which may be useful in identifying transformed cells. 

30 The vector of the invention may be used to transform cells so as to allow propagation of 
the nucleic acid fragments of the invention or so as to allow expression of the polypeptide 
fragments of the invention. Hence, the invention also pertains to a transformed cell 
harbouring at least one such vector according to the invention, said cell being one which 
does not natively harbour the vector and/or the nucleic acid fragment of the invention 

35 contained therein. Such a transformed cell (which is also a part of the invention) may be 
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any suitable bacterial host cell or any other type of cell such as a unicellular eukaryotic 
organism, a fungus or yeast, or a cell derived from a multicellular organism, e.g. an ani- 
mal or. a plant. It is especially in cases where glycosylation is desired that a mammalian 
cell is used, although glycosylation of proteins is a rare event in prokaryotes. Normally, 
5 however, a prokaryotic cell is preferred such as a bacterium belonging to the genera 
Mycobacterium, Salmonella, Pseudomonas, Bacillus and Eschericia. It is preferred that 
the transformed cell is an E coli f B. subtilis, or M. bovis BCG cell, and it is especially 
preferred that the transformed cell expresses a polypeptide according of the invention. 
The latter opens for the possibility to produce the polypeptide of the invention by simply 
10 recovering it from the culture containing the transformed cell. In the most preferred 
embodiment of this part of the invention the transformed cell is Mycobacterium bovis 
BCG strain: Danish 1331, which is the Mycobacterium bovis strain Copenhagen from the 
Copenhagen BCG Laboratory, Statens Seruminstitut, Denmark. 

15 The nucleic acid fragments of the invention allow for the recombinant production of the 
polypeptides fragments of the invention. However, also isolation from the natural source 
is a way of providing the polypeptide fragments as is peptide synthesis. 

Therefore, the invention also pertains to a method for the preparation of a polypeptide 
20 fragment of the invention, said method comprising inserting a nucleic acid fragment as 
described in the present application into a vector which is able to replicate in a host cell, 
introducing the resulting recombinant vector into the host cell (transformed cells may be 
selected using various techniques, including screening by differential hybridization, 
identification of fused reporter gene products, resistance markers, anti-antigen antibodies 
25 and the like), culturing the host cell in a culture medium under conditions sufficient to 
effect expression of the polypeptide (of course the cell may be cultivated under conditions 
appropriate to the circumstances; and if DNA is desired, replication conditions are used), 
and recovering the polypeptide from the host cell or culture medium; or 

30 isolating the polypeptide from a short-term culture filtrate; or 

isolating the polypeptide from whole mycobacteria of the tuberculosis complex or from 
lysates or fractions thereof, e.g. cell wall containing fractions, or 

35 synthesizing the polypeptide by solid or liquid phase peptide synthesis. 
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The medium used to grow the transformed cells may be any conventional medium 
suitable for the purpose. A suitable vector may be any of the vectors described above, 
and an appropriate host cell may be any of the cell types listed above. The methods 
5 employed to construct the vector and effect introduction thereof into the host cell may be 
any methods known for such purposes within the field of recombinant DNA. In the follow- 
ing a more detailed description of the possibilities will be given: 

In general, of course, prokaryotes are preferred for the initial cloning of nucleic se- 
10 quences of the invention and constructing the vectors useful in the invention. For ex- 
ample, in addition to the particular strains mentioned in the more specific disclosure 
below, one may mention by way of example, strains such as £. coli K12 strain 294 
(ATCC No. 31446), £ coli B, and £ coliX 1776 (ATCC No. 31537). These examples 
are, of course, intended to be illustrative and not limiting. 

15 

Prokaryotes are also preferred for expression. The aforementioned strains, as well as £ 
coli W31 10 (F-, lambda-, prototrophic, ATCC No. 273325), bacilli such as Bacillus 
subtilis, or other enterobacteriaceae such as Salmonella typhimurium or Serratia mar- 
cesans, and various Pseudomonas species may be used. Especially interesting are 
20 rapid-growing mycobacteria, e.g. M. smegmatis, as these bacteria have a high degree of 
resemblance with mycobacteria of the tuberculosis complex and therefore stand a good 
chance of reducing the need of performing post-translational modifications of the 
expression product. 

25 In general, plasmid vectors containing replicon and control sequences which are derived 
from species compatible with the host cell are used in connection with these hosts. The 
vector ordinarily carries a replication site, as well as marking sequences which are 
capable of providing phenotypic selection in transformed cells. For example, £ co//is 
typically transformed using pBR322, a plasmid derived from an £ coli species (see, e.g., 

30 Bolivar et al., 1977, Gene 2: 95). The pBR322 plasmid contains genes for ampiciilin and 
tetracycline resistance and thus provides easy means for identifying transformed cells. 
The pBR plasmid, or other microbial plasmids or phages must also contain, or be 
modified to contain, promoters which can be used by the microorganism for expression. 
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Those promoters most commonly used in recombinant DNA construction include the B- 
lactamase (penicillinase) and lactose promoter systems (Chang et al., (1978), Nature, 
35:515; Itakura et al., (1977), Science 198:1056; Goeddel et al., (1979), Nature 281:544) 
and a tryptophan (trp) promoter system (Goeddel et al., (1979) Nature 281:544; EPO 
5 AppL Publ. No. 0036776). While these are the most commonly used, other microbial 
promoters have been discovered and utilized, and details concerning their nucleotide 
sequences have been published, enabling a skilled worker to ligate them functionally with 
plasmid vectors (Siebwenlist et al., (1980), Cell, 20:269). Certain genes from prokaryotes 
may be expressed efficiently in E. coli from their own promoter sequences, precluding the 
1 0 need for addition of another promoter by artificial means. 

After the recombinant preparation of the polypeptide according to the invention, the 
isolation of the polypeptide may for instance be carried out by affinity chromatography (or 
other conventional biochemical procedures based on chromatography), using a 
15 monoclonal antibody which substantially specifically binds the polypeptide according to 
the invention. Another possibility is to employ the simultaneous electroelution technique 
described by Andersen et al. in J. Immunol. Methods 161: 29-39. 

s 

According to the invention the post-translational modifications involves lipidation, gly- 
20 cosylation, cleavage, or elongation of the polypeptide. 

In certain aspects, the DNA sequence information provided by this invention allows for 
the preparation of relatively short DNA (or RNA or PNA) sequences having the ability to 
specifically hybridize to mycobacterial gene sequences. In these aspects, nucleic acid 

25 probes of an appropriate length are prepared based on a consideration of the relevant 
sequence. The ability of such nucleic acid probes to specifically hybridize to the 
mycobacterial gene sequences lend them particular utility in a variety of embodiments. 
Most importantly, the probes can be used in a variety of diagnostic assays for detecting 
the presence of pathogenic organisms in a given sample. However, either uses are 

30 envisioned, including the use of the sequence information for the preparation of mutant 
species primers, or primers for use in preparing other genetic constructs. 

Apart from their use as starting points for the synthesis of polypeptides of the invention 
and for hybridization probes (useful for direct hybridization assays or as primers in e.g. 
35 PCR or other molecular amplification methods) the nucleic acid fragments of the 
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invention may be used for effecting in vivo expression of antigens, i.e. the nucleic acid 
fragments may be used in so-called DNA vaccines. Recent research have revealed that a 
DNA fragment cloned in a vector which is non-replicative in eukaryotic cells may be 
introduced into an animal (including a human being) by e.g. intramuscular injection or 
5 percutaneous administration (the so-called "gene gun" approach). The DNA is taken up 
by e.g. muscle cells and the gene of interest is expressed by a promoter which is func- 
tioning in eukaryotes, e.g. a viral promoter, and the gene product thereafter stimulates 
the immune system. These newly discovered methods are reviewed in Ulmer et al., 
(1993), Curr. Opin. Invest. Drugs, 2:983-989 which hereby is included by reference. 



10 



Hence, the invention also relates to a vaccine comprising a nucleic acid fragment ac- 
cording to the invention, the vaccine effecting in vivo expression of antigen by an animal, 
including a human being, to whom the vaccine has been administered, the amount of 
expressed antigen being effective to confer substantially increased resistance to infec- 
15 tions with mycobacteria of the tuberculosis complex in an animal, including a human 
being. 

The efficacy of such a "DNA vaccine" can possibly be enhanced by administering the 
gene encoding the expression product together with a DNA fragment encoding a poly- 

20 peptide which has the capability of modulating an immune response. For instance, a 
gene encoding lymphokine precursors or lymphokines (e.g. IFN-y, IL-2, or IL-12) could be 
administered together with the gene encoding the immunogenic protein, either by ad- 
ministering two separate DNA fragments or by administering both DNA fragments 
included in the same vector. It also is a possibility to administer DNA fragments compri- 

25 sing a multitude of nucleotide sequences which each encode relevant epitopes of the 
polypeptides disclosed herein so as to effect a continuous sensitization of the immune 
system with a broad spectrum of these epitopes. 

In one embodiment of the invention, any of the above mentioned polypeptides is used in 
30 the manufacture of an immunogenic composition to be used for induction of an immune 
response in a mammal against an infection with a virulent Mycobacterium. Preferably, the 
immunogenic composition is used as a vaccine. 

The preparation of vaccines which contain peptide sequences as active ingredients is 
35 generally well understood in the art, as exemplified by U.S. Patents 4,608,251; 
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4,601,903; 4,599,231; 4,599,230; 4,596,792; and 4,578,770, all incorporated herein by 
re f erenC e. Typically, such vaccines are prepared as injectables either as liquid solutions 
or suspensions; solid forms suitable for solution in liquid or suspension in liquid prior to 
injection may also be prepared. The preparation may also be emulsified. The active 
5 immunogenic ingredient is often mixed with excipients which are pharmaceutical^ 
acceptable and compatible with the active ingredient. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof. 
In addition, if desired, the vaccine may contain minor amounts of auxiliary substances 
such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance 
10 the effectiveness of the vaccines. 

In one embodiment the composition used for vaccination comprises at least one, but 
preferably at least 2, such as at least 3, 4, 5, 10, 15 or at least 20 different polypeptides 
of the invention. 

15 

In another embodiment the composition to be used for vaccine comprises, together with 
at least one polypeptide of the invention, at least one, but preferably at least 2, such as at 
least 3, 4, 5, 10, 15 or at least 20 polypeptides which are not polypeptides of the present 
invention but are derived from a virulent Mycobacterium such as a polypeptide belonging 
20 to the group of ST-CF (Elhay MJ and Andersen P, Immunology and cell Biology (1997) 
75, 595-603). ESAT-6, CFP7, CFP10 (EMBL accession number: AL022120), CFP17, 
CFP21, CFP25, CFP29, MPB59, MPT59, MPB64, and MPT64. 

The vaccines are conventionally administered parenterally, by injection, for example, 
25 either subcutaneously or intramuscularly. Additional formulations which are suitable for 
other modes of administration include suppositories and, in some cases, oral 
formulations. For suppositories, traditional binders and carriers may include, for example, 
polyalkalene glycols or triglycerides; such suppositories may be formed from mixtures 
containing the active ingredient in the range of 0.5% to 10%, preferably 1-2%. Oral 
30 formulations include such normally employed excipients as, for example, pharmaceutical 
grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, 
magnesium carbonate, and the like. These compositions take the form of solutions, 
suspensions, tablets, pills, capsules, sustained release formulations or powders and 
contain 10-95% of active ingredient, preferably 25-70%. 
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The proteins may be formulated into the vaccine as neutral or salt forms. 
Pharmaceutical^ acceptable salts include acid addition salts (formed with the free amino 
groups of the peptide) and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, 
5 mandelic, and the like. Salts formed with the free carboxyl groups may also be derived 
from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or 
ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2- 
ethylamino ethanol, histidine, procaine, and the like. 

10 The vaccines are administered in a manner compatible with the dosage formulation, and 
in such amount as will be therapeutically effective and immunogenic. The quantity to be 
administered depends on the subject to be treated, including, e.g., the capacity of the 
individual's immune system to mount an immune response, and the degree of protection 
desired. Suitable dosage ranges are of the order of several hundred micrograms of active 

15 ingredient per vaccination with a preferred range from about 0.1 ng to 1000 jxg f such as 
in the range from about 1 fig to 300 ^ig, and especially in the range from about 10 ng to 
50 ^ig. Suitable regimes for initial administration and booster shots are also variable but 
are typified by an initial administration followed by subsequent inoculations or other 
administrations. 

20 

The manner of application may be varied widely. Any of the conventional methods for 
administration of a vaccine are applicable. Preferred routes of administration are the 
parenteral route such as the intravenous, intraperitoneal, intramuscular, subcutaneous or 
intradermal routes; the oral (on a solid physiologically acceptable base or in a physiologi- 
25 caily acceptable dispersion), buccal, sublingual, nasal, rectal or transdermal routes. The 
dosage of the vaccine will depend on the route of administration and will vary according 
to the age of the person to be vaccinated and, to a lesser degree, the weight of the 
person to be vaccinated. 

30 Some of the polypeptides of the vaccine are sufficiently immunogenic in a vaccine, but for 
some of the others the immune response will be enhanced if the vaccine further 
comprises an adjuvant substance. 

Various methods of achieving adjuvant effect for the vaccine include use of agents such 
35 as aluminum hydroxide or phosphate (alum), commonly used as a 0.05 to 0.1 percent 
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solution in phosphate buffered saline, admixture with synthetic polymers of sugars 
(Carbopol) used as a 0.25 percent solution, aggregation of the protein in the vaccine by 
heat treatment with temperatures ranging between 70° to 101 °C for 30 second to 2 
minute periods respectively. Aggregation by reactivating with pepsin treated (Fab) 
5 antibodies to albumin, mixture with bacterial cells such as C. parvum or endotoxins or 
lipopolysaccharide components of gram-negative bacteria, emulsion in physiologically 
acceptable oil vehicles such as mannide mono-oleate (Aracel A) or emulsion with 20 
percent solution of a perfluorocarbon (Fluosol-DA) used as a block substitute may also 
be employed. According to the invention DDA (dimethyldioctadecylammonium bromide) 
10 is an interesting candidate for an adjuvant, but also Freund's complete and incomplete 
adjuvants as well as QuilA and RIBI adjuvants are interesting possibilities. 

Other possibilities to enhance the immunogenic effect involve the use of immune 
modulating substances such as lymphokines (e.g. IFN-y, IL-2 and IL-12) or synthetic IFN- 
15 y inducers such as poly l:C in combination with the above-mentioned adjuvants. 

In many instances, it will be necessary to have multiple administrations of the vaccine, 
usually not exceeding six vaccinations, more usually not exceeding four vaccinations and 
preferably one or more, usually at least about three vaccinations. The vaccinations will 
20 normally be at from two to twelve week intervals, more usually from three to five week 
intervals. Periodic boosters at intervals of 1-25 years, such as 20 years, preferably 15 or 
10 years, more preferably 1-5 years usually three years, will be desirable to maintain the 
desired levels of protective immunity. 

25 In one embodiment of the invention a composition is produced comprising as the 
effective component a micro-organism, the micro-organism is a bacterium such as 
Mycobacterium, Salmonella, Pseudomonas and Escherichia, preferably Mycobacterium 
bovis BCG wherein at least one, such as at least 2 copies, such as at least 5 copies of a 
nucleotide fragment comprising a nucleotide sequence encoding a polypeptide of the 

30 invention has been incorporated into the genome of the micro-organism or introduced as 
a part of an expression vector in a manner allowing the micro-organism to express and 
optionally secrete the polypeptide. In a preferred embodiment, the composition comprises 
at least 2 different nucleotide sequences encoding at least 2 different polypeptides of the 
invention. In a much preferred embodiment, the composition comprises at least 2 

35 different nucleotide sequences encoding at least one polypeptide of the invention and at 
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least one polypeptide belonging to the group of ST-CF (Elhay MJ and Andersen P, 
Immunology and cell Biology (1997) 75, 595-603) such as ESAT-6, CFP7, CFP10, 
CFP17, CFP21, CFP25, CFP29, MPB59, MPT59, MPB64, and MPT64. 

5 Individuals infected with virulent Mycobacteria can generally be divided into two groups. 
The first group has an infection with a virulent Mycobacterium e.g. contacts of TB 
patients. The virulent Mycobacterium may have established colonies in the lungs, but the 
individual has, as yet, no symptoms of TB. The second group has clinical symptoms of 
TB, as a TB patient. 

10 

In one embodiment of the invention, any of the above mentioned polypeptides are used 
for the manufacture of a diagnostic reagent that preferably distinguishes a subclinical^ or 
clinically infected individual (group I and group II) from an individual who has been BCG 
vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis 
15 Mycobacterium (NTM), and may distinguish a subclinical^ or clinically infected individual 
from an individual who has cleared a previous infection with a virulent Mycobacterium. It 
is most likely that specific polypeptides derived from SPE will identify group I and/or 
group II from individuals not infected with virulent Mycobacteria in the same way as 
ESAT-6 and CFP10 (P.Ravn et al., (1998), J. Infectious Disease 179:637-45). 

20 

In one embodiment of the invention, any of the above discussed polypeptides are used 
for the manufacture of a diagnostic reagent for the diagnosis of an infection with a virulent 
Mycobacterium. One embodiment of the invention provides a diagnostic reagent for 
differentiating an individual who is clinically or subclinical^ infected with a virulent 

25 Mycobacterium from an individual not infected with virulent Mycobacterium, i.e. an 
individual who has been BCG vaccinated or infected with Mycobacterium avium or 
sensitised by non-tuberculosis Mycobacterium (NTM). Such a diagnostic reagent will 
distinguish between an individual in group I and/or II of the infection stages above, from 
an individual who has been vaccinated against TB. Another embodiment of the invention 

30 provides a diagnostic reagent for differentiating an individual who is clinically or 

subclinical^ infected with a virulent Mycobacterium from an individual who has a cleared 
infection with a virulent Mycobacterium. Such a diagnostic reagent will distinguish 
between an individual in group I and/or II of the infection stages above, from an individual 
who has cleared the infection. 
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Determination of an infection with virulent Mycobacterium will be instrumental in the, still 
very laborious, diagnostic process of tuberculosis. A number of possible diagnostic 
assays and methods can be envisaged (some more specifically described in the 
examples and the list of properties): a sample comprising whole blood or mononuclear 
5 cells {La. T-lymphocytes) from a patient could be contacted with a sample of one or more 
polypeptides of the invention. This contacting can be performed in vitro and a positive 
reaction could e.g. be proliferation of the T-cells or release of cytokines such as IFN-y into 
the extracellular phase (e.g. into a culture supernatant). 

10 Alternatively, a sample of a possibly infected organ may be contacted with an antibody 
raised against a polypeptide of the invention. The demonstration of the reaction by 
means of methods well-known in the art between the sample and the antibody will be 
indicative of ongoing infection and could be used to monitor treatment effect by reduction 
in responses. It is of course also a possibility to demonstrate the presence of anti- 

15 Mycobacterial antibodies in serum by contacting a serum sample from a subject with at 
least one of the polypeptide fragments of the invention and using well-known methods for 
visualising the reaction between the antibody and antigen such as ELISA, Western blot, 
precipitation assays. 

20 Also a method of determining the presence of virulent Mycobacterium nucleic acids in a 
mammal, including a human being, or in a sample, comprising incubating the sample with 
a nucleic acid sequence of the invention or a nucleic acid sequence complementary 
thereto, and detecting the presence of hybridised nucleic acids resulting from the 
incubation (by using the hybridisation assays which are well-known in the art), is included 

25 in the invention. Such a method of diagnosing TB might involve the use of a composition 
comprising at least a part of a nucleotide sequence as defined above and detecting the 
presence of nucleotide sequences in a sample from the animal or human being to be 
tested which hybridises with the nucleic acid sequence (or a complementary sequence) . 
by the use of PGR techniques. 

30 

The invention also relates to a method of diagnosing infection caused by a virulent 
Mycobacterium in a mammal, including a human being, comprising locally applying 
(patch test) or intradermal^ injecting (Mantoux test) a polypeptide of the invention. These 
tests are both called a delayed hypersensitivity reaction (DTH). A positive skin response 
35 at the location of injection or application is indicative of the mammal including a human 
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being, being infected with a virulent Mycobacterium, and a negative skin response at the 
location of injection or application is indicative of the mammal including a human being 
not having TB. A positive response is a skin reaction having a diameter of at least 5 mm 
larger than background, but larger reactions are preferred, such as at least 1 cm, 1.5 cm, 
5 and at least 2 cm in diameter. A skin reaction is here to mean erythema or induration of 
the skin, as directly measured. The composition used as the skin test reagent can be 
prepared in the same manner as described for the vaccines above. 

In human volunteers, the generation of a significant immune response can alternatively 
10 be defined as the ability of the reagent being tested to stimulate an in vitro recall 
response by peripheral blood cells from at least 30% of PPD positive individuals 
previously vaccinated with that reagent or infected with a virulent Mycobacterium, said 
recall response being defined as proliferation of T cells or the production of cytokine(s) 
which is higher than the responses generated by cells from unimmunised or uninfected 
15 control individuals, with a 95% confidence interval as defined by an appropriate statistical 
analysis such as a Student's two-tailed T test. 

Alternatively, a significant immune response could be detected in vivo by a test such as 
the generation of delayed type hypersensitivity in the skin in response to exposure to the 
immunising reagent, such response being significantly larger (with a 95% confidence 
interval as defined by appropriate statistical analysis such as a Student's two-tailed T 
test) in at least 30% of vaccinated or infected individuals than in placebo-treated or 
uninfected individuals. 

The polypeptides according to the invention may be potential drug targets. Once a 
particular interesting polypeptide has been identified, the biological function of that 
polypeptide may be tested. The polypeptides may constitute receptor molecules or toxins 
which facilitates the infection by the Mycobacterium and if such functionality is blocked, 
the infectivity of the virulent Mycobacterium will be diminished. 

The biological function of particular interesting polypeptides may be tested by studying 
the effect of inhibiting the expression of the polypeptides on the virulence of the virulent 
Mycobacterium. This inhibition may be performed at the gene level such as by blocking 
the expression using antisense nucleic acid, PNA or LNA or by interfering with regulatory 
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sequences or the inhibition may be at the level of translation or post-translational 
processing of the polypeptide. 

Once a particular polypeptide according to the invention is identified as critical for 
5 virulence, an anti-mycobacterial agent might be designed to inhibit the expression of that 
polypeptide. Such anti-mycobacterial agent might be used as a prophylactic or 
therapeutic agent. For instance, antibodies or fragments thereof, such as Fab and (Fab') 2 
fragments, can be prepared against such critical polypeptides by methods known in the 
art and thereafter used as prophylactic or therapeutic agents 

10 

A presently preferred embodiment is an extract of polypeptides obtainable by a method 
comprising the steps of 

a) killing a sample of virulent Mycobacteria] 

b) centrifugating the sample of a) at 2,000g for 40 minutes; 

15 c) resuspending the pellet of b) in PBS and 0.5% Tween 20 and sonicating with 20 
rounds of 90 seconds; 

d) centrifugating the suspension of c) at 5,000g for 30 minutes; 

e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane 
components from the supernatant of d) with 10% SDS; 

20 f) centrifugating the extract of e) at 20,000g for 30 minutes; 

g) precipitating the supernatant of f) with 8 volumes of cold acetone; 

with an adjuvant substance. 

25 In other words, the invention relates to use of an extract of polypeptides with an adjuvant 
substance for the preparation of a composition for the generation or determination of an 
immune response against a virulent Mycobacterium. 

Finally, a monoclonal or polyclonal antibody, which is specifically reacting with a poly- 
30 peptide of the invention in an immuno assay, or a specific binding fragment of said anti- 
body, is also a part of the invention. The production of such polyclonal antibodies 
requires that a suitable animal be immunized with the polypeptide and that these anti- 
bodies are subsequently isolated, suitably by immune affinity chromatography. The 
production of monoclonals can be effected by methods well-known in the art, since the 
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present invention provides for adequate amounts of antigen for both immunization and 
screening of positive hybridomas. 
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Examples 



EXAMPLE 1: Total extraction of proteins from dead M.tuberculosis bacteria. 

1.5 x 10 9 bacteria/ml M.tuberculosis was heat treated at 55°C for 1.5 hours and checked 
for sterility. 10 ml of these heat killed bacteria was centrifuged at 2000 g for 40 min; the 
5 supernatant was discharged and the pellet resuspended in PBS containing 0.5% Tween 
20 and used as the antigen source. The pellet was sonicated with 20 rounds of 90 
seconds and centrifuged 30 min at 5000 g to remove unbroken cells. The supernatant 
containing soluble proteins as well as cell wall and cell membrane components was 
extracted twice with 10% SDS to release proteins inserted in the cell wall and membrane 
10 compartments. After a centrifugation at 20.000 g for 30 min the supernatant was 
precipitated with 8 volume of cold acetone and resuspended in PBS at a protein 
concentration of 5 mg/ml and named: Somatic Proteins Extract (SPE). 

Analysis of protective immune response for tuberculosis after immunisation 
with different M.tuberculosis protein preparations. 

15 The protective efficacy of SPE was evaluated in a vaccination experiment and compared 

to the two vaccines ST-CF and BCG, known to induce protection against TB. 

Five groups of 6-8 weeks old, female C57BI/6J mice (Bomholtgaard, Denmark) were 
immunised subcutaneously at the base of the tail with vaccines of the following 
20 composition: 

Group 1: BCG 

Group 2: 1x 10 7 heat killed M.tuberculosislDQA (250 fig DDA) 
Group 3: 50 fig ST-CF/DDA (250 fig) 
25 Group 4: 50 fig SPE/DDA (250 fig) 

Group 5: Adjuvant control: DDA (250 fig) in NaCI 

The animals were injected with a volume of 0.2 ml. The mice of groups2 ? 3 and 4 were 
boosted twice at two weeks interval. 
30 Four weeks after the last immunisation three mice/group were sacrificed and the spleens 
removed. The immune response induced in the spleen cells was monitored by release of 
IFN-y into the culture supernatants when stimulated in vitro with relevant antigens (Table 
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2). ST-CF and SPE induced a similar immune response while only a very low IFN-* 
release was observed after immunisation with BCG and stimulation with ST-CF. 

Table 2 Recognition of protein preparations after immunisation presented as IFN-y 
5 release (pg/ml) after restimulation. 



Immunogen 


No antigen 


ST-CF 


SPE 


ST-CF 


<200 


6752 ± 591 


8431 ±459 


SPE 


<200 


6621 ± 203 


11079 ±178 


BCG 


<200 


469 ± 32 


ND 



Seven weeks after the final immunisation the mice received a primary infection with 5x10 5 
H37Rv in 0.1 ml iv. and two weeks later the mice were sacrificed and the spleens were 
isolated for bacterial enumeration (figure 2). 
10 BCG induced a high level of protection in the spleen as expected but so did the killed 
H37Rv, ST-CF and SPE and all preparations induced protection at almost the same 
level, with SPE as the most potent of these preparations. 

These data demonstrate that there are components to be found among the somatic 
15 proteins of H37Rv which in an animal model protect against tuberculosis at the same 
level as BCG. 

EXAMPLE 2: Subcellular fractionation of Mycobacterium tuberculosis 

1.5 x 10 9 colony forming units (CFU/ml) of M. tuberculosis H37Rv were inactivated by 
heat-killing at 60°C for 1.5 hour. The heat-killed Mycobacteria was centrifuged at 3,000 x 

20 g for 20 min; the supernatant was discarded and the pellet was resuspended in cold PBS. 
This step was repeated twice. After the final wash, the pellet was resuspended in a 
homogenising buffer consisting of PBS supplemented with 10 mM EDTA and 1 mM of 
phenylmethylsulfonyl fluoride in a ratio of 1 ml buffer per 0.5 g of heat-killed 
Mycobacteria. The sample was sonicated on ice for 15 min (1-min-pulser-on/10-sec- 

25 pulser off) and subsequently lysed three times with a French Pressure Cell at 12,000 
lb/in 2 . The lysate was centrifuged at 27,000 x g for 20 min; the pellet was washed in 
homogenising buffer and recentrifuged. The pooled supernatants contained a mixture of 
cytosol and membrane components, while the pellet represented the crude ceil wall. 
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Preparation of cell wall 

The cell wall pellet, resuspended in homogenising buffer, was added RNase and DNase 
to a final concentration of 1 mg/ml and incubated overnight at 4°C. The cell wall was 
washed twice in homogenising buffer, twice in homogenising buffer saturated with KCI, 
5 and twice with PBS. Soluble proteins were extracted from the cell wall by a 2 hour 
incubation with 2% SDS at 6°C. The Insoluble cell wall core was removed by a 
centrifugation at 27,000 x g for 20 min and the SDS-extraction was repeated. Finally, the 
pooled supernatants were precipitated with 6 volumes of chilled acetone and 
resuspended in PBS. 

10 Preparation of cytosol and membrane: 

To separate the cytosol and the membrane fraction, the pooled supernatants were 

ultracentrifugated at 100,000'x g for 2 hours at 5°C. The cytosol proteins in the 
supernatant were precipitated with acetone and resuspended in PBS. The pellet, 
representing the membrane fraction, was washed in PBS, ultracentrifugated, and finally 
15 resuspended in PBS. 

Triton X-114 extraction of cell wall and membrane: 

To prepare protein fractions largely devoid of lipoarabinomannan, the cell wall and the 
membrane fraction were subjected to extraction with precondensed Triton X-114. Triton 
X-114 was added to the protein sample at a final concentration of 4%. The solution was 

20 mixed on ice for 60 min and centrifuged at 20,000 x g for 15 min at 4°C. The pellet 

containing residual insoluble material was extracted once more (membrane) or twice (cell 
wall), while the supernatant was warmed to 37°C to condense the Triton X-114. After 
centrifugation of the supernatant at 12,000 x g for 15 min, the aqueous phase and 
detergent phase were separated. The aqueous phase and detergent phase were washed 

25 twice with Triton X-1 14 and PBS, respectively. The combined aqueous phases and 
residual insoluble material containing the majority of proteins were pooled, precipitated 
with acetone, and resupended in PBS. 

The specificity of the human T-cell response in TB patients was investigated by 
30 stimulating PBMCs with panels of narrow molecular mass fractions from membrane, cell 
wall, and cytosol obtained by the multi-elution technique described by Andersen et al. 
(1993) J. Immunol. Methods 161:29-39. The technique resulted in 30 sharply defined 
fractions and allowed an identification of immunological- active regions, of potential as 
either diagnostic reagents or as vaccine components. 
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The study demonstrated that multiple targets within the cell wall, membrane, and cytosol 
were recognised by the donors and initiated IFN-y release as well as cellular proliferation 
(unpublished results). The broad cellular response were directed towards both the low 
molecular mass as well as the some of the higher molecular mass fractions. These 
5 experiments suggest the existence of numerous target antigens among the cell wall, 
membrane, and cytosol fractions and it is therefore likely that some of these will have a 
potential as a protective or diagnostic reagent. 

EXAMPLE 3: Identification of proteins from the cytosoiic fraction 

Use of patient sera to identify M. tuberculosis antigens 

10 This example illustrates the identification of antigens from the cytosol fraction by 
screening with serum from M. tuberculosis infected individuals in western blot. The 
reaction with serum was used as an indication that the proteins are recognised 
immunologically. 

15 The cytosol was precipitated with ammonium sulphate at 80% saturation. The non- 
precipitated proteins were removed by centrifugation and precipitated proteins were 
resuspended in 20 mM imidazole pH 7.0. The protein solution was applied to a DEAE 
Sepharose 6B column, equilibrated with 20 mM imidazole pH 7.0. Bound protein was 
eluted from the column using a salt gradient from 0 to 1 M NaCI, in 20 mM imidazole pH 

20 7.0. Fractions collected during elution was analysed on a silver stained 1 0-20% SDS- 
PAGE and on 2 dimensional electrophoresis. 

For use in western blot a pool of serum from 5 TB patients was made. These patients 
ranged from minimal to severe TB. Nitrocellulose membranes were blocked with 

25 phosphate buffer, pH 7.3, containing 0.37 M NaCI and 0.5% Tween-20, for 30 min. The 
serum pool was diluted in phosphate buffer pH 7.3 containing 0.37 M NaCI. The blots 
incubated in serum dilution overnight at room temperature on a shaker. Membranes were 
washed for four times five minutes in the dilution buffer, and incubated with 1 :1 ,000 
diluted peroxidase-labelled swine anti human-IgG (P214, Dako) for 1 hour at room 

30 temperature on a shaker. Blots were then washed for four times 5 min. in the dilution 
buffer and stained with DONS/TMB. 

N-terminal sequencing and amino acid analysis 

Proteins of the fractions containing bands reactive with serum from TB patients in 
Western blot were separated by 2D electrophoresis. Gels were blotted to PVDF 
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membranes and spots subjected to N-terminal sequencing on a Procise sequencer 
(Applied Biosystems). 



The following N-terminal sequences were obtained : 

5 

For TB1 5 :TERTAVLIKPDGIER 

(SEQ ID NO: 39) 

For TB18 :TDTQVTWLTQESHDR 

(SEQ ID NO: 40) 

10 For TB21 :MIDEALFDAEEKMEK 

(SEQ ID NO: 41) 

For TB33 : P L P A D P S T D L S A Y A Q 

(SEQ ID NO: 42) 

For TB38 : M L I S Q R P T L S E D V L T 

15 (SEQ ID NO: 43) 

For TB54 : T G N L V T K N S L T P D V R 

(SEQ ID NO: 44) 

Sequence identity searches 

The N-terminal sequences obtained were used for an identity search using the blast 
20 program of the Sanger M. tuberculosis database : 

http://www.sanger.ac.uk/Projects/M_tuberculosis/blast_server.shtml 

In addition, the GenEMBL database was searched using the BLASTP program (Altschul, 
Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J. Lipman (1990). 
25 Basic local alignment search tool. J. Mol. Biol. 215:403-10 ), to reveal proteins with 
homology to the full amino acid sequences obtained from the Sanger database. 



Thereby, the following information was obtained : 
TB15 

30 For the 15 determined N-terminal amino acids for TB15 a 93% identical sequence was 
found in MTV008.01c. Amino acid 5 of the determined N-terminal sequence (A) is an L in 
the sequence MTV008.01C. 
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Within the open reading frame the translated protein is 136 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 

This gives a protein of 136 amino acids, which corresponds to a theoretical molecular 
5 mass of 14 509 Da and a theoretical pi of 5.36. The observed mass in SDS-PAGE is 14 
kDa. 

TB15 has 80% sequence identity in a 139 amino acid overlap to a protein of M. 
smegmatis. It is homologous to putative nucleoside diphosphate kinases from several 
species, e.g. 59% sequence identity to a 151 amino acid protein of Archaeoglobus 
10 fulgidus and 57% sequence identity to a 149 amino acid protein of Bacillus subtilis. 

TB18 

For the 15 determined N-terminal amino acids for TB18 a 100% identical sequence was 
found in MTCY01 7.33c. 
15 Within the open reading frame the translated protein is 164 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 

This gives a protein of 164 amino acids, which corresponds to a theoretical molecular 
mass of 17 855 Da and a theoretical pi of 4.81. The observed mass in SDS-PAGE is 20 
20 kDa. 

TB18 has 94% sequence identity, in a 164 amino acid overlap, to a protein from M. 
leprae. In addition, it is homologous to transcription elongation factors from several 
species, e.g. 32% sequence identity in a 114 amino acid overlap, to a protein from 
Zymomonas mobilis. 

25 

TB21 

For the 15 determined N-terminal amino acids for TB21 a 100% identical sequence was 
found in MTCY274. 13c. 

Within the open reading frame the translated protein is 185 amino acids long. The N- 
30 terminal sequence of the protein identified in the cytosol starts at amino acid no 1. 
This corresponds to a theoretical molecular mass of 20 829 Da and a theoretical pi of 
5.81. The observed mass in SDS-PAGE is 22 kDa. 

TB21 has 90% sequence identity in a 185 amino acid overlap to a protein from M. leprae. 
In addition, it is homologous to ribosome recycling factors from several species, e.g. 63% 
35 in a 185 amino acid overlap to a protein from Streptomyces coelicolor. 
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TB33 

For the 15 determined N-terminal amino acids for TB33 a 85% identical sequence was 
found in MTCY71.23. Amino acids 8 and 9 of the determined N-terminal sequence (T and 
5 D) are a P and a T in MTCY71 .23, respectively. 

Within the open reading frame the translated protein is 297 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 

This gives a protein of 297 amino acids, which corresponds to a theoretical molecular 
10 mass of 33 323 Da and a theoretical pi of 4.91. The observed mass in SDS-PAGE is 35 
kDa. 

TB33 has 83% sequence identity in a 296 amino acid overlap to a protein from M. leprae. 
In addition, it is homologous to thiosulphate sulfurtransferases (rhodanese) from several 
species, e.g. 48% in a 131 amino acid overlap to rhodanese from Saccharopolyspora 
15 erythraea. 

TB38 

For the 15 determined N-terminal amino acids for TB38 a 100% identical sequence was 
found in MTCY13E12.10c. 

Within the open reading frame the translated protein is 347 amino acids long. The N- 
20 terminal sequence of the protein identified in the cytosol starts at amino acid no 1 . 
This corresponds to a theoretical molecular mass of 37 710 Da and a theoretical pi of 
4.53. The observed mass in SDS-PAGE is 38 kDa. 

TB38 is homologous to DNA-directed RNA polymerase alpha-chains from several 
species, e.g. 79% in a 321 amino acid ov6rlap to a protein from Streptomyces coelicolor. 

25 

TB54 

For the 15 determined N-terminal amino acids for TB54 a 100% identical sequence was 
found in MTCY20B1 1.23c. 

Within the open reading frame the translated protein is 495 amino acids long. The N- 
30 terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 

This gives a protein of 495 amino acids, which corresponds to a theoretical molecular 
mass of 54 329 Da and a theoretical pi of 5.00. The observed mass in SDS-PAGE is 60 
kDa. 
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TB54 is homologous to adanosyl homocysteinases from several species, e.g. 73% in a 
90 amino acid overlap to S-adenosyl-L-homocysteine hydrolase from Triticum aestivum. 
It contains a S-adenosyl-L-homocysteine hydrolase signature (PS00739). 

Example 3a: Use of patient sera to identify M. tuberculosis cytosol antigens. 

5 Anion exchange chromatography of the cytosol proteins and Western blot experiments 
with a pool of sera from TB patients were performed as described in Example 3. 

N-terminal sequencing 

Proteins of the fractions containing TB12.5, TB20.6, and TB40.8 were separated by 2D 
electrophoresis. Gels were blotted to PVDF membranes and spots subjected to N- 
10 terminal sequencing on a Procise sequencer (Applied Biosystems). 

The following N-terminal sequences were obtained : 



Sequence identity searches was performed as described in Example 3. 
20 Thereby, the following information was obtained : 
TB12.5 

For the 15 determined N-terminal amino acids of TB12.5 a 93 % identical sequence was 
found in Rv0801. The x in position 11 is a cysteine. 
25 Within the open reading frame the translated protein is 1 15 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 

This gives a protein of 1 15 amino acids, which corresponds to a theoretical molecular 
mass of 12 512 Da and a theoretical pi of 4.91. The observed mass in SDS-PAGE is 14 
30 kDa. 

No homology was found to TB12.5. 



For TB12.5 :ALKVEMVTFDXSDPA 



(SEQ ID NO: 80) 
(SEQ ID NO: 81) 
(SEQ ID NO: 82) 



15 For TB20.6 :ADADTTDFDVDAEAP 



For TB40.8 : S KTVL I LG AGVGGLT 



TB20.6 
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For the 15 determined N-terminal amino acids of TB20.6 a 100 % identical sequence was 
found in Rv3920c. 

Within the open reading frame the translated protein is 187 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 1 . 
5 This gives a protein of 187 amino acids, which corresponds to a theoretical molecular 
mass of 20.559 Da and a theoretical pi of 4.14. The observed mass in SDS-PAGE is 24 
kDa. 

TB20.6 has 73 % homology to a 193 amino acid protein of M. leprae. It has 59% 
homology in a 184 amino acid overlap to a Jag-like protein from Streptomyces coelicotor. 

10 

TB40.8 

For the 15 determined N-terminal amino acids of TB40.8 a 100 % identical sequence was 
found in Rv0331. 

Within the open reading frame the translated protein is 388 amino acids long. The N- 
15 terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 

This gives a protein of 388 amino acids, which corresponds to a theoretical molecular 
mass of 40 792 Da and a theoretical pi of 5.06. The observed mass in SDS-PAGE is 44 
kDa. 

20 No homology was found to TB40.8. 
Identification of abundant proteins 

As immunity to tuberculosis is not B-cell but T-cell mediated, reactivity with serum from 
TB patients was not the only selection criterion used to identify proteins from the cytosol. 
Further proteins were selected by virtue of their abundance in the cytosol. 

25 The cytosol was precipitated with ammonium sulphate at 80% saturation. The non- 
precipitated proteins were removed by centrifugation and precipitated proteins were 
resuspended in 20 mM imidazole, pH 7.0. The protein solution was applied to a DEAE 
Sepharose 6B column, equilibrated with 20 mM imidazole. Bound protein was eluted from 
the column using a salt gradient from 0 to 1 M NaCI, in 20 mM imidazole. Fractions 

30 collected during elution was analyzed on a silver stained 10-20% SDS-PAGE and on 2 
dimensional electrophoresis. Fractions containing well separated bands were selected for 
2D electrophoresis and blotted to PVDF, after which spots, visualised by staining with 
Coomassie Blue, were selected for N-terminal sequencing. 

35 The following N-terminal sequences were obtained : 
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:M EVKI G ITDSPRELV 
(SEQ ID NO: 45) 
:SAYKTVVVGTDDXSX 
(SEQ ID NO: 46) 
:MEQRAELVVGRALVV 
(SEQ ID NO: 47) 
:ADIDGVTGSAG L(N)P A 
(SEQ ID NO: 48) 
:TYETI LVERDQRVGI 
(SEQ ID NO: 49) 

TB10C 

No sequence identity was found, when searching the Sanger database using the blast 
15 program. However, when the blast program at Swiss-blast was used, a sequence was 
obtained. 

For the 15 determined N-terminal amino acids for TB10C a 93% identical sequence was 
obtained. The first amino acid of the N-terminal sequence (M) is a V in the sequence 
found, corresponding to GTG being used as a start codon, instead of ATG. 
20 Within the open reading frame the translated protein is 90 amino acids. The N-terminal 
sequence of the protein identified in the cytosol starts at amino acid 1 . 
This corresponds to a theoretical molecular mass of 9 433 Da and a theoretical pi of 
4.93. The observed mass in SDS-PAGE is 10 kDa. 

TB15A 

25 For the determined N-terminal sequence of TB15 a 78% identical sequence was found in 
CY01B2.28. The X at position 13 of the determined N-terminal sequence corresponds to 
a G in MTCY01 B2.28 and the X at position 1 5 to a D. 

Within the open reading frame the translated protein is 146 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
30 the N-terminal Met cleaved off. 

This gives a protein of 146 amino acids, which corresponds to a theoretical molecular 
mass of 15 313 Da and a theoretical pi of 5.60. The observed mass in SDS-PAGE is 16 
kDa. 



For TB10C 
For TB15A 

5 

For TB17 
For TB24 
10 For TB27B 
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The highest sequence identity, 32% in a 34 amino acid overlap, was found to a 
conserved protein of Methanobacterium thermoautotrophicum. 

TB17 

For the 15 determined N-terminal amino acids forTB17 a 100% identical sequence was 
5 foundinMTV044.12. 

Within the open reading frame the translated protein is 165 amino acids. The N-terminal 
sequence of the protein identified in the cytosol starts at amino acid 1 . 
This gives a protein of 165 aa. Theoretical molecular mass 16 793 Da and a theoretical pi 
of 4.22. The observed mass in SDS-PAGE is 18 kDa. 
10 TB17 is homologous to putative molybdenum cofactor biosynthesis proteins from several 
species, e.g. 34% in a 103 amino acid overlap to moaCB from Synechococcus spp. 

TB24 

For the 15 determined N-terminal amino acids forTB24 a 92% identical sequence was 
found in MTCY07D11.03. The tentative N in position 13 of the determined amino acid 
15 sequence is a Q in MTCY07D11.03, and the A at position 15 is a G. 

Within the open reading frame the translated protein is 216 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 

This gives a protein of 216 amino acids, which corresponds to a theoretical molecular 
20 mass of 24 227 Da and a theoretical pi of 4.91 . The observed mass in SDS-PAGE is 28 
kDa. 

TB24 is homologous to a RNA polymerase sigma-E factors from several species, e.g. 
55% in a 72 amino acid overlap to ECF sigma factor RpoE1 from Myxococcus xanthus. 

TB27B 

25 For the 15 determined N-terminal amino acids for TB27B a 100% identical sequence was 
found in MTCY01 7.23c. 

Within the open reading frame the translated protein is 257 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
the N-terminal Met cleaved off. 
30 This gives a protein of 257 amino acids, which corresponds to a theoretical molecular 
mass of 27 276 Da and a theoretical pi of 4.82. The observed mass in SDS-PAGE is 28 
kDa. 
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TB27B has 86% sequence identity in a 257 amino acid overlap, to a protein from M. 
leprae. In addition, it is homologous to enoyl-CoA hydratases from several species, e.g. 
66% in a 257 amino acid overlap to a protein from Rhizobium meliloti. 

5 Identification of TB13A : 

One protein spot was selected by its reaction with the monoclonal antibody ST-3 in 
western blot. N-terminal sequencing of the spot on the PVDF membrane corresponding 
to the ST-3 spot yielded the following results : 

10 For TB13A : P VT Q E E 1 1 A G I A E 1 1 

(SEQ ID NO: 50) 

Sequence identity search on the TB13A N-terminal sequence gave the following results. 
15 TB13A 

For the 15 determined N-terminal amino acids for TB13A a 100% identical sequence was 
found in MTCY427.25. 

Within the open reading frame the translated protein is 1 15 amino acids long. The N- 
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with 
20 the N-terminal Met cleaved off. 

This gives a protein of 115 amino acids, which corresponds to a theoretical molecular 
mass of 12 524 Da and a theoretical pi of 3.87. The observed mass in SDS-PAGE is 10 
kDa. 

TB13A has 94% sequence identity to a 1 15 amino acid protein of M. leprae. It is 
25 homologous to putative acyl carrier proteins from several species, e.g. 59% sequence 
identity to a 78 amino acid protein of Myxococcus xanthus and 56% to a 82 amino acid 
protein from Streptomyces coelicolor. 

identification of TB64 

Biotinylated proteins were purified from the cytosol fraction in the following way: 12 mg of 
30 the cytosol fraction was added to 100 nl of TetraLink Tetrameric Avidin Resin (Promega) 
in PBS, pH 7.4 in an eppendorf tube. After incubation over night at 4°C, centrifugation 
(1000 g for 5 min) was performed and the resin was washed five times with PBS, pH 7.4, 
each time followed by centrifugation and collection of the supernatant. Thereafter, 100 pi 
of 4 times concentrated SDS-PAGE sample buffer (0.08 M Tris-HCI, 8% SDS, 16% 
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glycerol, 24 mM EDTA , pH 8.0) was added to the resin and it was boiled for 20 minutes. 
After centrifugation the supernatant was collected and analysed for the presence of 
biotinylated proteins: The sample was analysed on SDS-PAGE followed by semi-dry 
blotting to nitrocellulose. The nitrocellulose membranes were incubated with alkaline 
5 phosphatase labeled streptavidin (D396, DAKO, Glostrup. Denmark). Nitro-blue 
tetrazolium/5-bromo-4-chloro-3-indolyl phosphate was used as substrate. 

N-terminal sequencing 

The eluate from the TetraLink Tetrameric Avidin Resin was loaded on a precast 10-20% 
Tricine SDS-PAGE gel (Novex, San Diego, USA). After electrophoresis the gel was 

10 blotted to Problott PVDF membrane (Applied Biosystems. Foster City, CA) by semidry 
electroblotting in 10 mM CAPS, 10% methanol. pH 1 1 . The PVDF membrane was 
stained with 0.1% Coomassie R-250 in 40% methanol, 1% acetid acid, and destained in 
50% methanol. A band of 10 kDa which was identified as a biotinylated protein as 
described above was excised and subjected to N-terminal sequence analysis by 

15 automated Edman degradation using a Precise 494 sequencer (Applied Biosystems) as 
described by the manufacturer. 

The following sequence was obtained: 

VIRRKPKPRXR (SEQ ID NO: 57) 

20 Submission of this sequence to the Sanger Centre M. tuberculosis blast server .dentrfied 
the open reading frame Rv3285 (91% identity in 1 1 amino acids) encoding a protein of 
600 amino acids. The determined sequence showed identity to amino acids 511 to 521 
suggesting that the identified peptide is a C-terminal fragment of the protein. As 
expected, the pattern for biotinylation of a lysine was identified in the C-terminal part of 
25 the protein: GDLWVLEAMKMENPVTA (residues 556-573, PROSITE pattern PS00188). 

EXAMPLE 4: Identification of proteins from the cell wall. 

Identification of TB11B, TB16, TB16A, TB32, TB32A, and TB51. 

Proteins contained in the cell wall fraction were separated by 2-D electrophoresis. A 
30 sample containing 120 mg protein was subjected to isoelectric focusing in a pH gradient 
from 4 to 7. The second dimension separation (SDS-PAGE) was carried out in a 10-20% 
acrylamide gradient. After blotting onto a PVDF membrane, proteins could be visualised 
by Coomassie blue staining. 
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N-terminal sequencing. 

The relevant spots were excised from the PVDF membrane and subjected to N-terminal 
sequencing using a Procise sequencer (Applied Biosystems). The following N-terminal 
sequences were obtained: 

5 

TB1 1 B:PWKINAIEVPAGA ( SEQ ID N0: 51 ) 

TB16:ADKTTQTIYIDADPG (SEQ ID NO: 52) 

TB 1 6A: PVLS KTVEVTADAAS ( SEQ ID N0: 53 > 

TB32:SGNSSLGIIVGIDD ( SEQ ID N0: W 

10 TB32A:AEVLVLVEHAEGALK (SEQ ID NO: 55) 

TB51 :MKSTVEQLSPTRVRI ( SEQ ID N0: 56 > 

N-terminal sequence identity searching and identification of the corresponding 
genes. 

The N-terminal amino acid sequence from each of the proteins identified was used for a 
15 sequence identity search using the tblastn program at NCBI: 
http://www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-blast? Jform=0 



The following information was obtained: 
20 TB11B: 

The 14 aa N-terminal sequence was found to be 100% identical to a sequence found on 
cosmid SCY06F7. 

The identity is found within an open reading frame of 105 amino acids lenght 
corresponding to a theoretical molecular mass of 1 1 1 85 Da and a pi of 6.18. The 
25 apparent molecular mass in an SDS-PAGE gel is 12 kDa. 

The amino acid sequence shows some low level similarity to oxygenases and 
hypothetical proteins. 

TB16: 

The 15 aa N-terminal sequence was found to be 100% identical to a sequence found 
30 within the Mycobacterium tuberculosis sequence MTV021 . 

The identity is found within an open reading frame of 144 amino acids length 
corresponding to a theoretical molecular mass of 16294 Da and a pi of 4.64. The 
apparent molecular mass in an SDS-PAGE gel is 17 kDa. 
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The amino acid sequence shows some similarity to other hypothetical Mycobacterial 
proteins. 

TB16A: 

The 15 aa N-terminal sequence was found to be 100% identical to a sequence found on 
5 cosmidl28. 

The identity is found within an open reading frame of 146 amino acids length 
corresponding to a theoretical molecular mass of 16 060 Da and a pi of 4.44. The 
apparent molecular mass in an SDS-PAGE gel is 14 kDa. 

TB32: 

10 The 14 aa N-terminal sequence was found to be 100% identical to a sequence found 

within the Mycobacterium tuberculosis sequence MTCY1A10. 

The identity is found within an open reading frame of 297 amino acids length 

corresponding to a theoretical molecular mass of 31654 Da and a pi of 5.55. The 

apparent molecular mass in an SDS-PAGE gel is 33 kDa. 
15 The amino acid sequence shows some similarity to other hypothetical Mycobacterial 

proteins. 

TB32A: 

The 15 aa N-terminal sequence was found to be 100% identical to a sequence found 
within the Mycobacterium tuberculosis sequence MTV012. 
20 The identity is found within an open reading frame of 318 amino acids length 
corresponding to a theoretical molecular mass of 31694 Da and a pi of 4.61. The 
apparent molecular mass in an SDS-PAGE gel is 32 kDa. 

The amino acid sequence reveals high sequence identity to the fixB gene product from 
several organisms. Probable electron transfer flavoprotein alpha subunit for various 
25 dehydrogenases. Equivalent to Mycobacterium leprae FixB. 

TB51: 

The 15 aa N-terminal sequence was found to be 100% identical to a sequence found 
within the Mycobacterium tuberculosis sequence MTV008. 
The identity is found within an open reading frame of 466 amino acids length 
30 corresponding to a theoretical molecular mass of 50587 Da and a pi of 4.3. The apparent 
molecular mass in an SDS-PAGE gel is 56 kDa. 

The amino acid sequence shows similarities to trigger factor from several organisms. 
Possible chaperone protein. 
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EXAMPLE 5: Cloning of the genes encoding TB10C, TB13A, TB17, TB11B, 
TB16, TB16A, TB32, TB51 

The genes encoding TB10C, TB13A, TB17, TB11B, TB16, TB16A, TB32, TB51 were all 
cloned into the E coli expression vector pMCT3, by PCR amplification with gene specific 
5 primers. 

Each PCR reaction contained 10 ng of M tuberculosis chromosomal DNA in 1x low salt 
Taq+ buffer (Stratagene) supplemented with 250 |uM of each of the four nucleotides 
(Boehringer Mannheim), 0.5 mg/ml BSA (IgG technology), 1% DMSO (Merck), 5 pmoles 
of each primer, and 0.5 unit Taq+ DNA polymerase (Stratagene) in 10 reaction volume. 
10 Reactions were initially heated to 94°C for 25 sec. and run for 30 cycles according to the 
following program; 94°C for 10 sec, 55°C for 10 sec, and 72°C for 90 sec, using 
thermocycler equipment from Idaho Technology. 

The PCR fragment was ligated with TA cloning vector pCR® 2.1 (Invitrogen) and 
transformed into E. coli. Plasmid DNA was thereafter prepared from clones harbouring 

15 the desired fragment, digested with suitable restriction enzymes and subcloned into the 
expression vector pMCT3 in frame with 6 histidine residues which are added to the N- 
terminal of the expressed proteins. The resulting clones were hereafter sequenced by 
cycle sequencing using the Dye Terminator system in combination with an automated gel 
reader (model 373A; Applied Biosystems) according to the instructions provided. Both 

20 strands of the DNA were sequenced. 

Expression and metal affinity purification of recombinant proteins was undertaken 
essentially as described by the manufacturers. For each protein, 1 I LB-media containing 
100 ng/ml ampicillin, was inoculated with 10 ml of an overnight culture of XL1-Blue cells 

25 harbouring recombinant pMCT3 plasmids. Cultures were shaken at 37°C until they 

reached a density of OD 60 o= 0.4 - 0.6. IPTG was hereafter added to a final concentration 
of 1 mM and the cultures were further incubated 4-16 hours. Cells were harvested, 
resuspended in 1x sonication buffer + 8 M urea and sonicated 5 x 30 sec with 30 sec. 
pausing between the pulses. 

30 After centrifugation, the lysate was applied to a column containing 10 ml of resuspended 
Talon resin (Clontec, Palo Alto, USA). The column was washed and eluted as described 
by the manufacturers. 

After elution, all fractions (1.5 ml each) were subjected to analysis by SDS-PAGE using 
the Mighty Small (Hoefer Scientific Instruments, USA) system and the protein 
35 concentrations were estimated at OD 28 onm. Fractions containing recombinant protein 
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were pooled and dialysed against 3 M urea in 10 mM Tris-HCI, pH 8.5. The dialysed 
protein was further purified by FPLC (Pharmacia, Sweden) using 1 ml HiTrap columns 
(Pharmacia, Sweden) eluted with a linear salt gradient from 0 - 1 M NaCI. Fractions were 
analysed by SDS-PAGE and protein concentrations were estimated at OD 28 onm- Fractions 
5 containing protein were pooled and dialysed against 25 mM Hepes buffer, pH 8.5. 
Finally, the protein concentration and the LPS content were determined by the BCA 
(Pierce, Holland) and LAL (Endosafe, Charleston, USA) tests, respectively. 

For cloning of the individual proteins, the following gene specific primers were used : 

10 

TB10C : Primers used for cloning of TB10C : 

TB10C-F . CTG AGA TCT GTG GAG GTC AAG ATC GGT (SEQ ID NO: 58) 

TB1 OC-R : CTC CCA TGG CTAC TTA CCC GCT CGT AGC AAC (SEQ ID NO: 59) 

15 TB10C-F and TB10C-R create BG/II and Ncol sites, respectively, used for the cloning in 
pMCT3. 



TB13A : Primers used for cloning of TB13A : 
20 TB13A-F : CTG AGA TCT CCT GTC ACT CAG GAA GAA (SEQ ID NO: 60) 

TB13A-R : CTC CCA TGG GAA ACC GCC ATT AGC GGT (SEQ ID NO: 61) 

TB13A-F and TB13A-R create BG/II and Ncol sites, respectively, used for the cloning in 
pMCT3. 

25 

TB17 : Primers used for cloning of TB17 : 

TB17-F : CCC AAG CTT ATG GAA CAG CGT GCG GAG (SEQ ID NO: 62) 

TB17-R : CTC CCA TGG CGA CAC TCG ATC CGG ATT (SEQ ID NO: 63) 

30 

TB17-F and TB17-R create BG/II and Ncol sites, respectively, used for the cloning in 
pMCT3. 



35 



TB1 1 B : Primers used for cloning of TB1 1 B : 
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TB1 1 B-F : CTG AGA TCT ATG CCA GTG GTG AAG ATC (SEQ ID NO: 64) 

TB1 1 B-R : CTC CCA TGG TTA TGC AGT CTT GCC GGT (SEQ ID NO: 65) 

TB1 1 B-F and TB11B-R create BG/II and Ncol sites, respectively, used for the cloning in 
5 pMCT3. 

TB16 : Primers used for cloning OF TB16 : 

TB16-F : CTG AGA TCT GCG GAC AAG ACG ACA CAG (SEQ ID NO: 66) 

TB16-R : CTC CCA TGG TAC CGG AAT CAC TCA GCC (SEQ ID NO: 67) 

10 

TB16-F and TB16-R create BG/II and Ncol sites, respectively, used for the cloning in 
pMCT3. 

15 TB16A : Primers used for cloning of TB16A : 

TB1 6A-F : CTG AGA TCT CCA GTT TTG AGC AAG ACC (SEQ ID NO: 68) 

TB16A-R : CTC CCA TGG GCA CAT GCC TTA GCT GGC (SEQ ID NO: 69) 

TB16A-F and TB16A-R create BG/II and Ncol sites, respectively, used for the cloning in 
20 pMCT3. 



TB32 : Primers used for cloning of TB32 : 

TB32-F : CTG AGA TCT ATG TCA TCG GGC AAT TCA (SEQ ID NO: 70) 

25 TB32-R : CTC CCA TGG CTAC CTA AGT CAG CGA CTC GCG (SEQ ID NO: 71) 

TB32-F and TB32-R create BG/II and Ncol sites, respectively, used for the cloning in 
pMCT3. 

30 

TB51 : Primers used for cloning of TB51 : 

TB51-F : CTG AGA TCT GTG AAG AGC ACC GTC GAG (SEQ ID NO: 72) 

TB51-R : CTC CCA TGG GTC ATA CGG TCA CGT TGT (SEQ ID NO: 73) 
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TB51-F and TB51-R create BG/II and Ncol sites, respectively, used for the cloning in 
pMCT3. 

TB1 5A: Primers used for cloning of TB1 5A: 

5 

TB1 5A-F: CTG CCA TGG CTA GGT GGT GTG CAC GAT C (SEQ ID NO: 89) 

TB1 5A-R: CTG AAG CTT ATG AGC GCC TAT AAG ACC (SEQ ID NO: 90) 

TB15-F and TB15-R create Ncol and Hindlll sites, respectively, used for the cloning in 
10 pMCT3. 

TB21 : Primers used for cloning of TB21 : 

r 

TB21-F: CTG AGA TCT ATG ATT GAT GAGGCT CTC (SEQ ID NO: 91) 

15 TB21-R: CTC CCA TGG AGC GGC CGC TAG ACC TCC (SEQ ID NO: 92) 

TB21-F and TB21-R create Bglll and Ncol sites, respectively, used for the cloning in 
pMCT3. 

20 TB24: Primers used for cloning of TB24: 

TB24-F: G GCTGAG ACTC ATG GCC GAC ATC GAT GGT G (SEQ ID NO: 93) 

TB24-R: CGTACCATGG TCA TGA CGA CAC CCC CTC GTG (SEQ ID NO: 94) 

25 TB24-F and TB24-R create Bglll and Ncol sites, respectively, used for the cloning in 
pMCT3. 

TB32A: Primers used for cloning of TB32A: 

30 TB32A-F: GGCTGAGACTC ATG GCT GAA GTA CTG GTG C (SEQ ID NO: 95) 

TB32A-R: CGTACCATGGCTA GCC GGC GAC CGC CGG TTC (SEQ ID NO: 96) 

TB32A-F and TB32A-R create Bglll and Ncol sites, respectively, used for the cloning in 
pMCT3. 

35 



WO 00/21983 



52 



PCT/DK99/00538 



TB14: Primers used for cloning of TB14: 

TB14-F: 5 -GTG ACC GAA CGG ACT CTG GT-3' (SEQ ID NO: 97) 

TB14-R: 5'-CTA GGC GCC GGG AAA CCA GAG-3' (SEQ ID NO: 98) 

5 

TB18: Primers used for cloning of TB18: 



TB18-F: 5'-ATG ACG GAT ACT CAA GTC ACC TG-3' (SEQ ID NO: 99) 

TB18-R: 5'-GGA GTG GTA CGG CTC GGC GC-3' (SEQ ID NO: 100) 

10 

TB27: Primers used for cloning of TB27: 

TB27-F: 5'-ATG ACG TAC GAA ACC ATC CT-3' (SEQ ID NO: 101) 

TB27-R: 5 -TCA TCG GTG GGT GAA CTG GGG-3' (SEQ ID NO: 102) 

15 

TB33: Primers used for cloning of TB33: 

TB33-F: 5-ATG CCG CTT CCC GCA GAC CCT AG-3' (SEQ ID NO: 103) 

TB33-R: 5*-TAC GAC GGG TAC CAC TCC TGG-3' (SEQ ID NO: 104) 



20 



TB38: Primers used for cloning of TB38: 



TB38-F: 5 -ATG CTG ATC TCA CAG CGC CCC A-3' (SEQ ID NO: 105) 

TB38-R: 5 -AAG CTG TTC GGT TTC GGC GTA G-3' (SEQ ID NO: 106) 

25 

TB54: Primers used for cloning of TB54: 

TB54-F: 5' -ATG ACC GGA AAT TTG GTG AC-3' (SEQ ID NO: 1 07) 

TB54-R: 5 -TCA GTA GCG GTA GTG GTC CGG-3' (SEQ ID NO: 1 08) 

30 

TB14,TB18,TB27,TB33,TB38 and TB54 will be cloned in ex-pressions vector pBAD- 
TOPO (Invitrogen). 



35 



Example 5a: Cloning of the genes encoding TB12.5, TB20.6, and TB40.8 

The genes encoding TB12.5, TB20.6, and TB40.8 were all cloned into the E. coli 
expression vector pMCT3 as described in Example 5. 
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For cloning of the individual genes, the following gene specific primers were used: 

TB12.5: Primers used for cloning of TB12.5: 
5 TB1 2.5-F: CTG AGA TCT ATG GCA CTC AAG GTA GAG (SEQ ID NO: 83) 

TB1 2.5-R: CTC CCA TGG TTA TTG ACC CGC CAC GCA (SEQ ID NO: 84) 

TB1 2.5-F and TB1 2.5-R create Bg/ll and Nco\ sites, respectively, used for the cloning in 
pMCT3. 

10 

TB20.6: Primers used for cloning of TB20.6: 

TB20.6-F: CTG AGA TCT ATG GCC GAC GCT GAC ACC (SEQ ID NO: 85) 

TB20.6-R: CTC CCA TGG CTA GTC GCG GAG CAC AAC (SEQ ID NO: 86) 

15 TB20.6-F and TB20.6-R create Bg/ll and Nco\ sites, respectively, used for the cloning in 
pMCT3. 

TB40.8: Primers used for cloning of TB40.8: 

TB40.8-F: CTG AGA TCT ATG AGC AAG ACG GTT CTC (SEQ ID NO: 87) 

20 TB40.8-R: CTC CCA TGG TCA CGT CTT CCA GCG GGT (SEQ ID NO: 88) 

TB40.8-F and TB40.8-R create Bg/ll and A/col sites, respectively, used for the cloning in 
pMCT3. 

25 Expression/purification of recombinant proteins was performed as described in Example 

5. 

EXAMPLE 6: Evaluation of immunological activity of identified somatic 
proteins. 

Each of the proteins identified in either the cell wall, cytosol or the cell membrane derived 
30 from M. tuberculosis will be evaluated for the immunological recognition in M. tuberculosis 
infected animals or in TB patients. 
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IFN^y induction in the mouse model of TB infection 

The recognition of an antigen by IFN-y producing T cells in M. tuberculosis infected 
animals or in TB patients is presently believed to be the most relevant correlate of 
protective immunity. 

5 We will therefore evaluate the ability of the polypeptides of the invention to induce an 
IFN-y production in mice of four different haplotypes during a primary infection: 8-12 
weeks old female mice C57BL/6j (H-2 b ), CBA/J (H-2 k ), DBA.2 (H-2 d ) and A.SW (H-2 S ) 
mice (Bomholtgaard, Ry, Denmark) will be infected i.v. via the lateral tail vein with an 
inoculum of 5 x 10 4 M. tuberculosis suspended in PBS in a vol. of 0.1 ml. 14 days 
10 postinfection the animals will be sacrificed and spleen cells isolated and tested for 
proliferation and the IFN-y release in response to stimulation with the recombinantly 
produced proteins. 

As a specific model we will analyse the recognition of the purified polypeptides of the 
15 invention the mouse model of memory immunity to TB: A group of efficiently protected 
mice will be generated by infecting 8-12 weeks old female C57BI/6j mice with 5 x 10 4 
M. tuberculosis i.v. After 30 days of infection the mice will be subjected to 60 days of 
antibiotic treatment with isoniazid (Merck and Co., Rahway, NJ) and rifabutin (Farmatalia 
Carlo Erba, Milano, Italy) then left for 200-240 days to ensure the establishment of resting 
20 long-term memory immunity. Such memory immune mice are very efficient protected 
against a secondary infection (Orme; Andersen, Boom 1993, J.lnfect.Dis. 167: 1481- 
1497). Long lasting immunity in this model is mediated by a population of highly reactive 
CD4 cells recruited to the site of infection and triggered to produce large amounts of IFN- 
y in response to M.tuberculosis antigens. 
25 This model will be used to identify single antigens recognised by protectiveT cells. 
Memory immune mice will be reinfected with 1 x 10 6 M.tuberculosis i.v and splenic 
lymphocytes harvested at day 4-6 of reinfection and proliferation and the amount of IFN-y 
produced in response to any of the recombinantly produced proteins will be evaluated. 

IFN^y induction in humans during infection with virulent Mycobacteria. 

30 IFN-y is currently believed to be the best marker of protective immunity in humans. In 
patients with limited tuberculosis, high levels of IFN-y can be induced, in contrast to 
patients with severe TB who often respond with low levels of IFN-y (Boesen et al (1995), 
Human T-cell response to secreted antigen fractions of M.tuberculosis. Infection and 
Immunity 63(4):1491-1497). Furthermore, IFN-y release has been shown to correlate 
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inversely with the severity of disease as determined by X-ray findings (Sodhi A, et al 
(1997) Clinical correlates of IFN-gamma production in patients with Tuberculosis, Clinical 
Infectious disease. 25; 617-620). Healthy exposed contacts of sputum positive TB 
patients also produce very high levels of IFN-y in response to mycobacterial antigens 
5 (unpublished, manus in prep) indicative of early, subclinical infection. Together these 
findings indicate that those individuals who are relatively protected (i.e. minimal TB 
patients) respond with high levels of IFN-y. The ability of the polypeptides to induce IFN-y 
release in cultures of PBMC or whole blood from 20 PPD responsive patients with 
microscopy or culture proven TB (0-6 month after diagnosis), exposed household 

10 contacts, or BCG vaccinated individuals from different geographical regions will be 

evaluated. Evaluation of donors from different geographical regions will enable us to take 
into account the influence of i.e. exposure to virulent Mycobacterium or NTM (Non- 
Tuberculous Mycobacteria) and different genetic background. The most important 
selection criteria for vaccine candidates are the polypeptides which are recognised by 

15 >30% of the donors with a level of IFN y >30% of that induced by a crude antigen 
preparation like ST-CF, PPD and SPE. 

Cultures will be established with 1 to 2 x 10 5 PBMC in 200^1 in microtiter plates (Nunc, 
Roskilde, Denmark) or with 1 ml of serum or plasma stimulated with the identified 
polypeptide and the IFN-y release measured by ELISA. 
20 Polypeptides of the invention frequently recognised will be preferred. 

The use of polypeptides as diagnostic reagents: 

A polypeptide has diagnostic potential in humans when it is inducing significantly higher 
responses in patients with microscopy or culture positive tuberculosis compared to PPD 
positive or PPD negative individuals with no known history of TB infection or exposure to 

25 M.tuberculosis but who may or may not have received a prior BCG vaccination, have 
been exposed to non-tuberculous mycobacteria(NTM), or be actively infected with 
M.avium. To identify polypeptides capable of discriminating between the above 
mentioned groups, the level of response and the frequency of positive responders to the 
polypeptide is compared. By positive responders are meant i) in vitro IFN-y release by 

30 PBMC or whole blood stimulated with the polypeptide of at least 3-500 pg/ml above 
background or another cut off relating to the specific test kit used, ii) reactivity by human 
serum or plasma from TB patients with the polypeptide using conventional antibody 
ELISA/Western blot or iii) in vivo delayed type hypersensitivity response to the 
polypeptide which is at least 5 mm higher than the response induced by a control 

35 material. 
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The diagnostic potential of polypeptides will initially be evaluated in 10 individuals with TB 
infection and 1 0 individuals with no known exposure to virulent Mycobacteria. High 
specificity, >80% .will be the most important selection criteria for these polypeptides and 
5 a sensitivity >80% is desirable but sensitivity >30% is acceptable as combinations of 
several specific antigens may be preferred in a cocktail of diagnostic reagent recognised 
by different individuals. 

Skin test reaction in TB infected guinea pigs 

To identify polypeptides as antigens with the potential as TB diagnostic reagents the 
10 ability of the proteins to induce a skin test response will be evaluated in the guinea pig 
model where groups of guinea pigs have been infected with either M. tuberculosis or 
M. avium or vaccinated with BCG. 

To evaluate the response in M. tuberculosis infected guinea pigs, female outbred guinea 
15 pigs will be infected via an ear vein with 1 x 10 4 CFU of M. tuberculosis H37Rv in 0.2 ml of 
PBS or aerosol infected (in an exposure chamber of a Middlebrook Aerosol Generation 
device) with 1x 10 5 CFU/ml of M. tuberculosis Erdman given rise to 10-15 granulomas per 
animal in the lung. After 4 weeks skin test will be performed with the polypeptides diluted 
in 0.1 ml of PBS and 24 hours after the injection reaction diameter is measured. 

20 

To evaluate the response in M.avium infected guinea pigs, female outbred guinea pigs 
will be infected intradermally with 2 x 10 6 CFU of a clinical isolate of M.avium (Atyp.1443; 
Statens Serum Institut, Denmark). Skin test are performed 4 weeks after with the 
polypeptides diluted in 0.1 ml of PBS and 24 hours after the injection reaction diameter is 
25 measured. 

To evaluate the response in BCG vaccinated guinea pigs, female outbred guinea pigs will 
be sensitized intradermally with 2 x 10 6 CFU of BCG (BCG Danish 1331 ; Statens Serum 
Institut). Skin test are performed 4 weeks after with the polypeptides diluted in 0.1 ml of 
30 PBS and 24 hours after the injection reaction diameter is measured. 

If a polypeptide induces a significant reaction in animal infected with M. tuberculosis but 
not in BCG vaccinated guinea pigs this polypeptide may have a potential as a diagnostic 
reagent to differentiate between BCG vaccinated and M. tuberculosis infected individuals, 
35 which will hereafter be evaluated in the human population. 
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If a polypeptide induces a reaction in M. tuberculosis infected guinea pigs but not in 
guinea pigs infected with M. avium , this polypeptide may have a potential as a diagnostic 
reagent with respect to differentiate between an individual infected with M [tuberculosis 
and an individual infected with Mycobacteria not belonging to the tuberculosis complex. 
5 The polypeptide may also have a potential as a diagnostic reagent to differentiate 
between a M.avium and a M. tuberculosis infected individual. 

Induction of protective immunity by the recombinant proteins in the mice model. 
The recombinant polypeptides will be evaluated as immunological compositions in mice. 
Female C57BL/6j mice of 6-8 weeks old (Bomholtgaard, Denmark) will be immunised 

10 subcutaneously at the base of the tail with the recombinantly produced polypeptides with 
DDA as adjuvant. The mice will be vaccinated with a volume of 0.2 ml in total of three 
times with two weeks interval between each immunisation. One week after last 
immunisation the mice will be bled and the blood cells isolated. The immune response 
induced will be monitored by release of IFN-y into the culture supernatant when 

15 stimulated in vitro with the homologous proteins. 

6 weeks after the last immunisation the mice will be aerosol challenged with 5.5 ml of 5 x 
10 6 viable M.tuberculosisfr\)\. After 6 weeks of infection the mice will be killed and the 
number of viable bacteria in lung and spleen determined by plating serial 3-fold dilution of 
organ homogenates on 7H11 plates. Colonies will be counted after 2-3 weeks of 

20 incubation and the levels of protection induced by each of the single polypeptide will be 
determined. 

Example 6a: Interferon^ induction in human TB patients and BCG 
vaccinated 

Human donors: PBMC were obtained from healthy BCG vaccinated donors with no 
25 known exposure to M. tuberculosis and from patients with culture or microscopy proven 
infection with TB. Blood samples were drawn from the TB patients 0-6 months after 
diagnosis of tuberculosis, and 20 months to 40 years after BCG vaccination. 

Lymphocyte preparations and cell culture: PBMC were freshly isolated by gradient 
30 centrifugation of heparinized blood on Lymphoprep (Nycomed, Oslo, Norway) and stored 
in liquid nitrogene until use. The cells were resuspended in complete RPMI 1640 medium 
(Gibco, Grand Island, N.Y.) supplemented with 1% penicillin/streptomycin (Gibco BRL, 
Life Technologies), 1% non-essential-amino acids (FLOW, ICN Biomedicals, CA, USA), 
and 10% normal human ABO serum (NHS) from the local blood bank. The number and 
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the viability of the cells were determined by Nigrosin staining. Cultures were established 
with 1.25 x 10 s PBMCs in 100 p\ in microtitre plates (Nunc, Roskilde, Denmark) and 
stimulated with ST-CF (Sfxg/ml), TB13A, TB15A, TB17, TB18, TB33, TB1 1B, TB16A, 
TB16, TB32 t and TB51 in a final concentration of 10 fig/ml. No antigen and 
5 phytohaemagglutinin (PHA) were used as negative and positive control, respectively. 
Supernatants for the detection of cytokines were harvested after 5 days of culture, 
pooled, and stored at -80°C until used. 

Cytokine analysis: Interferon-y (IFN-y) was detected with a standard sandwich ELISA 
10 technique using a commercially available pair of monoclonal antibodies (Endogen) and 
used according to the manufacturers instruction. Recombinant IFN-y (Endogen) was used 
as a standard. All data are means of duplicate wells and the variation between wells did 
not exceed 10 % of the mean. Cytokine levels below 50 pg/ml were considered negative. 
Responses of 10 individual donors are shown in TABLE 3. 

15 

As shown in Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, 
Table 11, and Table 12 a marked release of IFN-y is observed after stimulation with some 
of the recombinant proteins. For 50% of the donors, stimulation with TB18, TB32, and 
TB51 give rise to high IFN-y responses (> 1,000 pg/ml). Less than 1/3 of the donors 

20 recognised TB1 5A and TB1 1 B at this level. Between 30 and 70% of the donors show 
intermediate IFN-y response (> 500 pg/ml) when stimulated with TB17 and TB16A 
whereas only limited response was obtained by TB13A, TB33, and TB16. However, 
TB13A, TB33 and TB16 may still be of immunological importance and meet some of the 
other properties of the present invention. E.g. as demonstrated for TB33 which is 

25 recognised by a pool of sera from human TB-patients. 
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Table 3 Stimulation of PBMCs from 6 healthy BCG vaccinated and 4 TB patients with 
recombinant TB13A. Responses to ST-CF and PHA are shown for comparison. Results 
are given as pg IFN-y/ml. 



5 BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 


ST-CF 


TB13A 




(1 M9/ml) 


(5 ng/ml) 


(10 fig/ml) 


1 


12 


11572 


10860 


41 


2 


0 


14257 


11536 


0 


3 


7 


13270 


8844 


493 


4 


0 


13193 


2828 


0 


5 


4 


14239 


14275 


332 


6 


0 


16278 


12623 


0 


TB patients 


Donor 


No ag 


PHA 


ST-CF 


TB13A 




(1 f»g/ml) 


(5 ng/ml) 


(10ng/ml) 


1 


0 


9914 


3297 


0 


2 


51 


10058 


6489 


0 


3 - 


0 


10587 


9155 


0 


4 


0 


9458 


5236 


18 



10 
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Table 4 Stimulation of PBMCs from 6 healthy BCG vaccinated and 5 TB patients with 
recombinant TB15A. Responses to ST-CF and PHA are shown for comparison. Results 
are given as pg IFN-y/ml. 



5 BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 

(1 HQ/ml) 


ST-CF 
(5 ng/ml) 


TB15A 
(10 M g/ml) 


1 


0 


18860 


3733 


1478 


2 


0 


16218 


2856 


0 


3 


94 


18427 


13998 


0 


4 


0 


17815 


4255 


0 


5 


0 


15981 


10830 


441 


6 


81 


16961 


11165 


8009 


TB patients 


Donor 


No ag 


PHA 

(1 ng/mi) 


ST-CF 
(5 ng/ml) 


TB15A 
(10ng/ml) 


1 


231 


18854 


6443 


57 


2 


0 


17213 


2196 


0 


3 


0 


17880 


1049 


0 


4 


0 


17777 


2865 


0 


5 


0 


17487 


5321 


0 



Table 5 Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB17. 
10 Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN- 
y/ml 



BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 

(1 ng/mi) 


ST-CF 
(5 ng/ml) 


TB17 
(10ng/ml) 


1 


33 


16696 


7304 


66 


2 • 


102 


16878 


6427 


50 


3 


49 


12161 


11055 


0 


4 


0 


12949 


2284 


73 


5 


81 


12129 


6669 


1029 


6 


0 


12706 


11762 


656 
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Table 6 Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with 
recombinant TB18. Responses to ST-CF and PHA are shown for comparison. Results 
are given as pg IFN-y/ml 



5 BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 


ST-CF 


TB18 




(1 ng/mi) 


(5 ng/ml) 


(10 pg/ml) 


1 


82 


20862 


15759 


842 


2 


7 


17785 


10088 


1855 


3 


912 


16198 


11350 


6838 


TB patients 


Donor 


No ag 


PHA(1 


ST-CF(5 


TB18(10 






ng/ml) 


ng/ml) 


Mg/mi) 


1 


60 


12301 


11057 


265 


2 


7 


10390 


6123 


167 


3 


34 


11678 


8136 


1629 
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Table 7 Stimulation of PBMCs from 5 healthy BCG vaccinated and 6 TB patients with 
recombinant TB33. Responses to ST-CF and PHA are shown for comparison. Results 
are given as pg IFN-y/ml. 



5 BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 
(1 ng/ml) 


ST-CF 
(5 ng/ml) 


TB33 
(10ng/ml) 


1 


589 


10068 


4426 


721 


2 


1953 


10817 


6316 


662 


3 


702 


11837 


1640 


0 


4 


605 


9463 


2694 


0 


5 


2471 


7990 


5979 


0 


TB patients 


Donor 


No ag 


PHA 
d H9Vml) 


ST-CF 
(5 ng/ml) 


TB33 
(10 ng/ml) 


1 


0 


3647 


812 


0 


2 


0 


12266 


920 


0 


3 


0 


12899 


4388 


0 


4 


0 


10233 


7989 


0 
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Table 8 Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with 
recombinant TB1 1B. Responses to ST-CF and PHA are shown for comparison. Results 
are given as pg IFN-y/ml. 

5 BCG vaccinated control donors, no known TB exposure 

Donor Noag PHA ST-CF TB11B 

(1ng/ml) (5ng/ml) (lung/ml) 

13682 9067 1379 

13705 10169 2092 

13231 7740 0 



TB patients 



Donor 


Noag 


PHA 


ST-CF 


TB11B 




(1 ng/ml) 


(5 ng/m!) 


(10 ng/ml) 


1 


0 


13285 


8025 


0 


2 


0 


13157 


3945 


0 


3 


0 


13207 


4485 


0 



1 

2 
3 



0 
0 
0 



10 Table 9. Stimulation of PBMCs from 2 healthy BCG vaccinated and 5 TB patients with 
recombinant TB16A. Responses to ST-CF and PHA are shown for comparison. Results 
are given as pg IFN-y/ml. 



BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 
(1 M g/ml) 


ST-CF 
(5 ng/ml) 


TB16A 
(10ng/ml) 


1 


0 


12816 


1831 


645 


2 


0 


14530 


10293 


1404 


TB patients 


Donor 


No ag 


PHA 
(1 M g/ml) 


ST-CF 
(5 ng/ml) 


TB16A 
(10ng/ml) 


1 


0 


11606 


5460 


42 


2 


0 


11836 


5837 


977 


3 


388 


12353 


8401 


958 


4 


0 


9587 


3169 


499 


5 


43 


10820 


4869 


593 
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Table 10. Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB16. 
Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN- 
y/ml in BCG vaccinated control donors, no known TB exposure. 



Donor 


No ag 


PHA 


ST-CF 


TB16 






(1 M g/mi) 


(5 |ig/ml) 


(10 M g/ml) 


1 


33 


16696 


7304 


0 


2 


102 


16878 


6427 


292 


3 


49 


12161 


11055 


514 


4 


0 


12949 


2284 


24 


5 


81 


12129 


6669 


58 


6 


0 


12706 


11762 


36 



Table 11. Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with 
recombinant TB32. Responses to ST-CF and PHA are shown for comparison. Results 
are given as pg IFN-y/ml. 



10 BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 


ST-CF 


TB32 






(1 M9/ml) 


(5 ng/ml) 


(10 fig/ml) 


1 


82 


20862 


15759 


1614 


2 


7 


17785 


10088 


3385 


3 


912 


16198 


11350 


9863 



TB patients 



Donor 


No ag 


PHA 


ST-CF 


TB32 






(1 ng/ml) 


(5 ng/ml) 


(10ng/ml) 


1 


60 


12301 


11057 


562 


2 


7 


10390 


6123 


206 


3 


34 


11678 


8136 


83 
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Table 12. Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB51. 
Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN- 
y/ml. 



5 BCG vaccinated control donors, no known TB exposure 



Donor 


No ag 


PHA 


ST-CF 


TB51 




(1 ng/mi) 


(5 fig/ml) 


(10ng/ml) 


1 


33 


16696 


7304 


596 


2 


102 


16878 


6427 


1155 


3 


49 


12161 


11055 


2247 


4 


0 


12949 


2284 


777 


5 


81 


12129 


6669 


140 


6 


0 


12706 


11762 


1123 
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Figure legends: 
Figure 1: 

Long term protection against TB can be induced by immunisation with dead 
M.tuberculosis. 

5 Mice received either: three immunisations with 1x10 7 CFU of dead M.tuberculosis H37Rv 
(squares); three immunisations with 50 ^g of ST-CF (triangles); one immunisation with 5 
x 10 4 CFU of live M.tuberculosis H37Rv (circle) and was hereafter cleared for the 
infection by administration of isoniazid in the drinking water. At 3, 6 and 12 month after 
the last immunisation the mice received an infection with M.tuberculosis H37Rv and two 
10 weeks later the bacterial load and the resistance against TB in the spleens were 
determined. 

Figure 2: 

Mice received three immunisations with 50^g of either of the three vaccines: heat killed 
H37Rv, SPE or ST-CF or received a vaccination with BCG. Two weeks after a primary 
15 infection the bacterial load in the spleen was used to determined the resistance against 
TB. 
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Claims 

1 . A substantially pure polypeptide, which has a sequence identity of at least 80% to an 
amino acid sequence selected from the group consisting of SEQ ID NOs: 34, 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 75, 77 and 79 or a subsequence of 
5 at least 6 amino acids thereof, wherein the polypeptide or the subsequence thereof has 
at least one of the following properties: 

i) the polypeptide induces an in vitro recall response determined by a release of IFN-y of 
at least 1,500 pg/ml from reactivated memory T-lymphocytes withdrawn from a mouse 

10 within 4 days after the mouse has been rechallenged with 1 x 10 6 virulent Mycobacteria, 
the induction being performed by the addition of the polypeptide to a suspension 
comprising about 2 x 10 5 cells isolated from the spleen of said mouse, the addition of the 
polypeptide resulting in a concentration of the polypeptide of not more than 20 fig per ml 
suspension, the release of IFN-y being assessable by determination of IFN-y in 

15 supernatant harvested 3 days after the addition of the polypeptide to the suspension, 

ii) the polypeptide induces an in vitro response during primary infection with virulent 
Mycobacteria, determined by release of IFN-y of at least 1 ,500 pg/ml from T-lymphocytes 
withdrawn from a mouse within 28 days after the mouse has been infected with 5 x 10 4 

20 virulent Mycobacteria, the induction being performed by the addition of the polypeptide to 
a suspension comprising about 2 x 10 5 cells isolated from the spleen, the addition of the 
polypeptide resulting in a concentration of not more than 20 jig per ml suspension, the 
release of IFN-y being assessable by determination of IFN-y in supernatant harvested 3 
days after the addition of the polypeptide to the suspension, 

25 

iii) the polypeptide induces a protective immunity determined by vaccinating an animal 
with the polypeptide and an adjuvant in a total of three times with two weeks interval 
starting at 6-8 weeks of age, 6 weeks after the last vaccination challenging with 5 x 10 6 
virulent Mycobacterial^ by aerosol and determining a significant decrease in the number 

30 of bacteria recoverable from the spleen 6 weeks after the animal has been challenged, 
compared to the number recovered from the same organ in an animal given placebo 
treatment, 

iv) the polypeptide induces in vitro recall response determined by release of IFN-y of at 
35 least 1,000 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from TB 
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patients or PPD positive individuals 0-6 months after diagnosis, the induction being 
performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 
x 10 5 PBMC, the addition of the polypeptide resulting in a concentration of not more than 
20 |ig per ml suspension, the release of IFN-y being assessable by determination of IFN-y 
5 in supernatant harvested 5 days after the addition of the polypeptide to the suspension, 

v) the polypeptide induces a specific antibody response in a TB patient as determined by 
an ELISA technique or a western blot when the whole blood is diluted 1 :20 in PBS and 
stimulated with the polypeptide in a concentration of not more than 20 ng/ml. 

10 

vi) the polypeptide induces a positive in vitro response determined by release of IFN-y of 
at least 500 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an 
individual who is clinically or subclinical^ infected with a virulent Mycobacterium, the 
induction being performed by the addition of the polypeptide to a suspension comprising 

15 about 1 .0 to 2.5 x 10 5 PBMC, the addition of the polypeptide resulting in a concentration 
of not more than 20 ^g per ml suspension, the release of IFN-y being assessable by 
determination of IFN-y in supernatant harvested 5 days after the addition of the 
polypeptide to the suspension, and does not induce such an IFN-y release in an 
individual not infected with a virulent Mycobacterium, 

20 

vii) the polypeptide induces a positive in vitro response determined by release of IFN-y of 
at least 500 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an 
individual clinically or subclinical^ infected with a virulent Mycobacterium, the induction 
being performed by the addition of the polypeptide to a suspension comprising about 1.0 

25 to 2.5 x 10 5 PBMC, the addition of the polypeptide resulting in a concentration of not 
more than 20 ng per ml suspension, the release of IFN-y being assessable by determi- 
nation of IFN-y in supernatant harvested 5 days after the addition of the polypeptide to 
the suspension, and does not induce such a IFN-y release in an individual who has a 
cleared infection with a virulent Mycobacterium, 

30 

viii) the polypeptide induces a positive DTH response determined by intradermal injection 
of at most 100 jig of the polypeptide to an individual who is clinically or subclinical^ 
infected with a virulent Mycobacterium, a positive response having a diameter of at least 
10 mm 72 hours after the injection, and does not induce such a response in an individual 

35 not infected with a virulent Mycobacterium, 
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ix) the polypeptide induces a positive DTH response determined by intradermal injection 
of at most 100 |ig of the polypeptide to an individual who is clinically or subclinical^ 
infected with a virulent Mycobacterium, a positive response having a diameter of at least 
5 10 mm 72 hours after the injection, and does not induce such a response in an individual 
who has a cleared infection with a virulent Mycobacterium. 

2. A substantially pure polypeptide which comprises an amino acid sequence selected 
from the group consisting of SEQ ID NOs: 34, 2, 4, 6, 8 t 10, 12, 14, 16, 18, 20, 22, 24, 

10 26, 28, 30, 32, 36, 38, 75, 77 and 79. 

3. A polypeptide according to any of claims 1 or 2, which comprises an amino acid 
sequence which has a sequence identity of at least 80% to an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 34, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 

15 22, 24, 26, 28, 30, 32, 36, 38, 75, 77 and 79 and/or is a subsequence thereof 

4. A purified or non-naturally occurring polypeptide as defined in any of claims 1-3 which 
comprises a T cell epitope. 

20 5. A purified or non-naturally occurring polypeptide as defined in any of claims 1-4 which 
comprises a B cell epitope. 

6. A polypeptide according to any of claims 1-5, wherein the polypeptide is encodable by 
a nucleic acid sequence, which sequence 

25 

1) is the DNA sequence selected from the group consisting of SEQ ID NOs: 33, 1, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 74, 76, and 78 or an analogue of said 
sequence which hybridises with any of the DNA sequences shown in SEQ ID NOs: 33, 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37, 74, 76, or 78 or a DNA 

30 sequence complementary thereto, or a specific part thereof, preferably under stringent 
hybridisation conditions, and/or 

2) encodes a polypeptide, the amino acid sequence of which has a 80% sequence 
identity with an amino acid sequence selected from the group consisting of SEQ ID NOs: 

35 34, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 75, 77 and 79 and/or 
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3) constitutes a subsequence of any of the above mentioned DNA sequences, and/or 

4) constitutes a subsequence of any of the above mentioned polypeptide sequences. 

5 

7. A polypeptide as defined in any of claims 1-6 for use in medicine. 

8. Use of a polypeptide as defined in any of claims 1-6 for the manufacture of a 
diagnostic reagent for the diagnosis of an infection with a virulent Mycobacterium. 

10 

9. Use of a polypeptide as defined in any of claims 1-6 for the manufacture of a 
composition for induction of a protective immune response in a mammal against infection 
with a virulent Mycobacterium. 

15 10. A composition comprising a polypeptide as defined in any of claims 1-7, further 
comprising at least one other polypeptide derived from a virulent Mycobacterium. 

1 1 . A composition comprising, as the effective component, a micro-organism, wherein at 
least one copy of a DNA sequence comprising a DNA sequence encoding a polypeptide 

20 as defined in any of claims 1-6 has been incorporated into the genome of the micro- 
organism in a manner allowing the micro-organism to express and optionally secrete the 
polypeptide. 

12. A diagnostic reagent for diagnosing an infection with a virulent Mycobacterium 

25 comprising a polypeptide as defined in any of claims 1-7, optionally in combination with a 
pharmaceutical^ acceptable carrier or vehicle. 

13. A diagnostic reagent according to claim 12 for differentiating an individual who is 
clinically or subclinical^ infected with a virulent Mycobacterium from an individual not 

30 infected with virulent Mycobacterium. 



35 



14. A diagnostic reagent according to any of claims 12 for differentiating an individual 
who is clinically or subclinical^ infected with a virulent Mycobacterium from an individual 
who has a cleared infection with a virulent Mycobacterium. 
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15. A diagnostic reagent according to any of claims 12 for diagnosing an infection with 
Mycobacterium tuberculosis. 

16. An extract of polypeptides obtainable by a method comprising the steps of 
5 a) killing a sample of virulent Mycobacteria; 

b) centrifugating the sample of a) at 2,000g for 40 minutes; 

c) resuspending the pellet of b) in PBS and 0.5% Tween 20 and sonicating with 20 
rounds of 90 seconds; 

d) centrifugating the suspension of c) at 5,000g for 30 minutes; 

10 e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane 
components from the supernatant of d) with 10% SDS; 

f) centrifugating the extract of e) at 20,000g for 30 minutes; 

g) precipitating the supernatant of f) with 8 volumes of cold acetone; 
with an adjuvant substance. 

15 

17. Use of an extract of polypeptides with an adjuvant substance according to claim 16 
for the preparation of a composition for the generation of an immune response against a 
virulent Mycobacterium. 

20 18. A method of screening for inhibition of the infectivity of a virulent Mycobacterium 
belonging to the tuberculosis complex, said method comprising 

a) inhibiting the expression of one or more of the polypeptides according to the invention, 
and 

25 

b) observing the effect, if any, on the infectivity of the bacteria. 

19. A method according to claim 18 wherein the expression is inhibited by blocking the 
transcription of the polypeptides or by interfering with regulatory sequences. 

30 

20. A method according to claim 19, wherein the inhibition is at the level of translation or 
post-translational processing of the polypeptides or by direct interaction with the 
polypeptides. 
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21 . A method of using the polypeptides having a significant effect on the infectivity of a 
virulent Mycobacterium as tested in any of claims 18-20 for designing a prophylactic or 
therapeutic agent. 

5 22. A nucleotide sequence which is a nucleotide sequence selected from the group 
consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 
37, 74, 76 and 78 or an analogue of said sequence which hybridises with any of the 
nucleotide sequences shown in SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31 , 33, 35, 37, 74, 76 or 78 or a nucleotide sequence complementary thereto, or a 
1 0 specific part or subsequence thereof, preferably under stringent hybridisation conditions. 

23. A monoclonal or polyclonal antibody, which is specifically reacting with a polypeptide 
according to any of claims 1-7 in an immuno assay, or a specific binding fragment of said 
antibody. 
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Fig. 1 

Kinetics of protective efficacy of different 
mycobacterial preparations 
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Fig. 2 

Protective efficacy of various 
bacterial preparations in the spleen 
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SEQUENCE LISTING 
<110> Statens Serum Institute 

<120> TB vaccine and diagnostic based antigens 
from the M . tuberculosis cell 



<130> 21868PC1 
<160> 108 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 273 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . - . (270) 



<400> 1 

gtg gag gtc aag ate ggt ate acg gac agt ccg cgc gag ctg gtg ttc 
Val Glu Val Lys lie Gly He Thr Asp Ser Pro Arg Glu Leu Val Phe 
5 10 15 



1 



tec agt gcg cag acg ccc agt gag gta gaa gaa etc gtc age aae gcg 
Ser Ser Ala Gin Thr Pro Ser Glu Val Glu Glu Leu Val Ser Asn Ala 

25 30 



20 



ctg cgc gac gac tct ggt ttg ctg acc ctg acc gac gag egg ggc cgt 
Leu A?g Asp Asp Ser Gly Leu Leu Thr Leu Thr Asp Glu Arg Gly Arg 



35 



40 45 



cgc ttc eta att cac acc gee agg ate gcc tat gtc gag ate ggt gte 
Arg Phe Leu He His Thr Ala Arg He Ala Tyr Val Glu He Gly Val 



50 



55 60 



gca gac gcc cgc egg gtg ggc ttc ggc gte ggg gtg gac gee gca get 
JUa Asp Ala Arg Arg Val Gly Phe Gly Val Gly Val Asp Ala Ala Ala 
65 70 75 8U 

ggg tec gcc gga aag gtt get acg age ggg taa 
Glv Ser Ala Gly Lys Val Ala Thr Ser Gly 
85 90 



48 



96 



144 



192 



240 



273 



<210> 2 
<211> 90 
<212> PRT 

<213> M. Tuberculosis 



<400> 2 

Met Glu Val Lys He Gly He Thr 

1 5 
Ser Ser Ala Gin Thr Pro Ser Glu 
20 

Leu Arg Asp Asp Ser Gly Leu Leu 



Asp Ser Pro Arg Glu Leu Val Phe 

10 15 
Val Glu Glu Leu Val Ser Asn Ala 
25 30 
Thr Leu Thr Asp Glu Arg Gly Arg 
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35 40 45 

Arg Phe Leu He His Thr Ala Arg He Ala Tyr Val Glu He Gly Val 

50 55 60 

Ala Asp Ala Arg Arg Val Gly Phe Gly Val Gly Val Asp Ala Ala Ala 
65 * 70 75 80 

Gly Ser Ala Gly Lys Val Ala Thr Ser Gly 
85 90 

<210> 3 
<211> 348 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . . (345) 

<400> 3 

gtg cct gtc act cag gaa gaa ate att gec ggt ate gec gag ate ate 
Val Pro Val Thr Gin Glu Glu He He Ala Gly lie Ala Glu He He 
15 10 15 

gaa gag gta ace ggt ate gag ccg tec gag ate ace ccg gag aag teg 
Glu Glu Val Thr Gly He Glu Pro Ser Glu He Thr Pro Glu Lys Ser 
20 25 30 

ttc gtc gac gac ctg gac ate gac teg ctg teg atg gtc gag ate gee 
Phe Val Asp Asp Leu Asp He Asp Ser Leu Ser Met Val Glu He Ala 
35 " 40 45 

gtg cag ace gag gac aag tac ggc gtc aag ate ccc gac gag gac etc 
Val Gin Thr Glu Asp Lys Tyr Gly Val Lys He Pro Asp Glu Asp Leu 
50 55 60 

gee ggt ctg cgt acc gtc ggt gac gtt gtc gee tac ate cag aag etc 
Ala Gly Leu Arg Thr Val Gly Asp Val Val Ala Tyr He Gin Lys Leu 
65 1 70 75 80 

gag gaa gaa aac ccg gag gcg get cag gcg ttg cgc gcg aag att gag 
Glu Glu Glu Asn Pro Glu Ala Ala Gin Ala Leu Arg Ala Lys He Glu 
85 90 95 

teg gag aac ccc gat gec gtt gee aac gtt cag gcg agg ctt gag gec 
Ser Glu Asn Pro Asp Ala Val Ala Asn Val Gin Ala Arg Leu Glu Ala 
100 105 110 

gag tec aag tga 
Glu Ser Lys 
115 



48 



96 



144 



192 



240 



288 



336 



348 



<210> 4 
<211> 115 
<212> PRT 

<213> M. Tuberculosis 
<400> 4 

Met Pro Val Thr Gin Glu Glu He He Ala Gly He Ala Glu He He 

15 10 15 

Glu Glu Val Thr Gly He Glu Pro Ser Glu He Thr Pro Glu Lys Ser 
20 25 30 
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Phe Val Asp Asp Leu Asp He Asp Ser Leu Ser Met Val Glu He Ala 

35 40 45 

Val Gin Thr Glu Asp Lys Tyr Gly Val Lys lie Pro Asp Glu Asp Leu 

50 55 60 

Ala Gly Leu Arg Thr Val Gly Asp Val Val Ala Tyr lie Gin Lys Leu 

65 J 70 75 , 

Glu Glu Glu Asn Pro Glu Ala Ala Gin Ala Leu Arg Ala Lys lie Glu 

85 90 95 

Ser Glu Asn Pro Asp Ala Val Ala Asn Val Gin Ala Arg Leu Glu Ala 
100 105 110 

Glu Ser Lys 
115 

<210> 5 
<211> 411 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) • • • (408) 

<400> 5 A a 

gtg acc gaa egg act ctg gta ctg ate aag ccg gat ggc ate gaa agg 4 8 

Val Thr Glu Arg Thr Leu Val Leu He Lys Pro Asp Gly He Glu Arg 
1 5 10 15 

cag ctg ate ggc gag ate ate age cgc ate gag cgc aaa ggc etc acc 
Gin Leu He Gly Glu He He Ser Arg He Glu Arg Lys Gly Leu Thr 
20 25 30 

ate get gcg ctg cag etc agg acc gtc age gcg gag ttg gee age cag 
lie Ala Ala Leu Gin Leu Arg Thr Val Ser Ala Glu Leu Ala Ser Gin 
35 40 45 

cac tac gee gaa cat gaa ggc aaa eca ttc ttt gga teg ttg ctg gag 
His Tyr Ala Glu His Glu Gly Lys Pro Phe Phe Gly Ser Leu Leu Glu 
50 55 60 

ttc ate acg teg ggt ccg gtg gta gcg gcg ate gtg gag gga acc cga 
Phe lie Thr Ser Gly Pro Val Val Ala Ala He Val Glu Gly Thr Arg 
65 70 75 80 

acc ate ocq gcg gtt cgc caa etc gee ggc ggc acc gac ccg gtg cag 
HI He III III Val Arg Gin Leu Ala Gly Gly Thr Asp Pro Val Gin 
85 90 95 

acq gcg gcg ccc ggc aca ate egg ggc gac ttc get eta gag acg cag 
Ala Ala Ala Pro Gly Thr lie Arg Gly Asp Phe Ala Leu Glu Thr Gin 
100 105 HO 

ttc aac ctg gtg cac ggg tct gat teg gee gaa tec gcg cag cgc gaa 
Phe Asn Leu Val His Gly Ser Asp Ser Ala Glu Ser Ala Gin Arg Glu 
115 120 125 

ate gcg etc tgg ttt ccc ggc gee tag 
He Ala Leu Trp Phe Pro Gly Ala 
130 135 



96 



144 



192 



240 



288 



336 



384 



411 



<210> 6 
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<211> 136 
<212> PRT 

<213> M. Tuberculosis 
<400> 6 



Met 


Thr 


Glu 


Arg 


Thr 


Leu 


Val Leu He Lys 


Pro Asp Gly 


He 


Glu 


Arg 


1 






5 




10 






15 




Gin 


Leu 


He 


Gly 


Glu 


lie 


He Ser Arg He 


Glu Arg Lys 


Gly 


Leu 


Thr 








20 






25 




30 






He 


Ala 


Ala 


Leu 


Gin 


Leu 


Arg Thr Val Ser 


Ala Glu Leu 


Ala 


Ser 


Gin 






35 








40 


45 








His 


Tyr 


Ala 


Glu 


His 


Glu 


Gly Lys Pro Phe 


Phe Gly Ser 


Leu 


Leu 


Glu 




50 










55 


60 








Phe 


He 


Thr 


Ser 


Gly 


Pro 


Val Val Ala Ala 


He Val Glu Gly Thr Arg 


65 








70 




75 






80 


Ala 


He 


Ala 


Ala 


Val 


Arg 


Gin Leu Ala Gly Gly Thr Asp 


Pro 


Val 


Gin 








85 




90 






95 




Ala 


Ala 


Ala 


Pro 


Gly 


Thr 


He Arg Gly Asp 


Phe Ala Leu 


Glu 


Thr 


Gin 






100 






105 




110 






Phe 


Asn 


Leu 


Val 


His 


Gly 


Ser Asp Ser Ala 


Glu Ser Ala 


Gin 


Arg 


Glu 






115 








120 


125 








He 


Ala 


Leu Trp 


Phe 


Pro 


Gly Ala 











130 135 



<210> 7 
<211> 441 
<212> DNA 

<213> M. Tuberculosis 



<220> 

<221> CDS 

<222> (1) . . . (438) 



<400> 7 

atg age gec tat aag acc gtg gtg gta gga acc gac ggt teg gac teg 

Met Ser Ala Tyr Lys Thr Val Val Val Gly Thr Asp Gly Ser Asp Ser 

15 10 15 

teg atg cga gcg gta gat cgc get gee cag ate gee ggc gca gac gec 
Ser Met Arg Ala Val Asp Arg Ala Ala Gin He Ala Gly Ala Asp Ala 
20 25 30 

aag ttg ate ate gec teg gca tac eta cct cag cac gag gac get cgc 
Lys Leu He He Ala Ser Ala Tyr Leu Pro Gin His Glu Asp Ala Arg 
35 40 45 

gee gec gac att ctg aag gac gaa age tac aag gtg acg ggc acc gee 
Ala Ala Asp He Leu Lys Asp Glu Ser Tyr Lys Val Thr Gly Thr Ala 
50 55 60 

ccg ate tac gag ate ttg cac gac gee aag gaa cga gcg cac aac gee 
Pro He Tyr Glu He Leu His Asp Ala Lys Glu Arg Ala His Asn Ala 
65 ^ 70 75 80 

ggt gcg aaa aac gtc gag gaa egg ccg ate gtc ggc gee ccg gtc gac 
Gly Ala Lys Asn Val Glu Glu Arg Pro He Val Gly Ala Pro Val Asp 
85 90 95 

gcg ttg gtg aac ctg gec gat gag gag aag gcg gac ctg ctg gtc gtc 
Ala Leu Val Asn Leu Ala Asp Glu Glu Lys Ala Asp Leu Leu Val Val 
100 105 HO 



48 



96 



144 



192 



240 



288 



336 
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SS 2 2 S5 2 2 S £ S 2 2 2 2 2 2 2 

J 115 120 i " 

2 E 2 f a ? 2 2 2 K 2 2 2 2 2 2 2 2 

130 135 

acc acc tag 
Thr Thr 
145 



<210> 8 
<211> 146 
<212> PRT 

<213> M. Tuberculosis 

Met SerTla Tyr Lys Thr Val Val Val Gly Thr Asp Gly Ser Asp Ser 

10 



Ser Met Arg Ala Val Asp Arg Ala Ala Gin lie Ala Gly Ala Asp Ala 



25 30 



Lys Leu lie lie Ala Ser Ala Tyr Leu Pro Gin His Glu Asp Ala Arg 

Ma Ala Asp He Leu Lys Asp Glu Ser Tyr Lys Val Thr Gly Thr Ala 

55 

Pro lie Tyr Glu He Leu His Asp Ala Lys Glu Arg Ala His Asn Ala 

Gly Ala Lys Asn Val Glu Glu Arg Pro lie Val Gly Ala Pro Val Asp 

Ala Leu Val Asn Leu Ala Asp Glu Glu Lys Ala Asp Leu Leu Val Val 

100 1^5 
Gly Asn Val Gly Leu Ser Thr lie Ala Gly Arg Leu Leu Gly Ser Val 

ns I 20 
Pro Ala Asn Val Ser Arg Arg Ala Lys Val Asp Val Leu lie Val Hrs 



130 
Thr Thr 
145 



135 I 40 



at-t aac oat cqc acq gcg cac ggc gat gaa gac cac age ggg ccg ctt 
Val Asp Asp Arg ?S Ala His Gly Asp Glu Asp His Ser Gly Pro Leu 
20 25 

2 2 B 2 2 2 2 K S 2 2 2 2 2 2 2 



384 



432 



441 



<210> 9 
<211> 498 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . ■ (495) 

2 E 2 S 2 2 2 2 2 2 2 2 2 2 2 2 " 

5 10 .13 



96 



144 
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35 40 45 

gcg gtg teg gec gac gag gtc gag ate cga aat gcg ctg aac aca gcg 192 

Ala Val Ser Ala Asp Glu Val Glu lie Arg Asn Ala Leu Asn Thr Ala 

50 55 60 

gtg ate ggc ggg gtg gac ctg gtg gtg teg gtc ggc ggg acc ggg gtg 240 

Val He Gly Gly Val Asp Leu Val Val Ser Val Gly Gly Thr Gly Val 
65 ' 70 75 80 



acg cct cgc gat gtc acc ccg gaa gee acc cgc gac att ctg gac cgc 
Thr Pro Arg Asp Val Thr Pro Glu Ala Thr Arg Asp He Leu Asp Arg 
8 5 90 95 

gag ate etc ggt ate gee gag gee ate cgc gcg tec ggg ctg tec gcg 
Glu He Leu Gly He Ala Glu Ala He Arg Ala Ser Gly Leu Ser Ala 
100 105 110 

gga ate gtc gac gee ggg ttg teg cgc ggc ctg gcg ggt gtc tec ggc 
Gly lie Val Asp Ala Gly Leu Ser Arg Gly Leu Ala Gly Val Ser Gly 
115 ' 120 125 

age acg ctg gtg gtc aac etc gcg ggt teg cgt tat gcg gtg cgc gat 
Ser Thr Leu Val Val Asn Leu Ala Gly Ser Arg Tyr Ala Val Arg Asp 
130 135 140 

gga atg gcg acg ctg aat ccg eta gcg gca cag ate ate ggg cag ttg 
Gly Met Ala Thr Leu Asn Pro Leu Ala Ala Gin He He Gly Gin Leu 
145 150 155 160 

teg age ttg gag ate tga 
Ser Ser Leu Glu He 
165 



288 



336 



384 



432 



480 



498 



<210> 10 
<211> 165 
<212> PRT 

<213> M. Tuberculosis 



<400> 10 



Met Glu 


Gin 


Arg 


Ala 


Glu 


Leu 


Val Val 


Gly Arg 


Ala Leu Val Val 


Val 


1 




5 








10 




15 




Val Asp 


Asp 


Arg 


Thr 


Ala 


His 


Gly Asp 


Glu 


Asp 


His Ser Gly Pro 


Leu 




20 








25 






30 




Val Thr 


Glu 


Leu 


Leu 


Thr 


Glu 


Ala Gly 


Phe 


Val 


Val Asp Gly Val 


Val 




35 










40 






45 




Ala Val 


Ser 


Ala 


Asp 


Glu 


Val 


Glu He 


Arg 


Asn 


Ala Leu Asn Thr 


Ala 


50 










55 








60 




Val He 


Gly 


Gly 


Val 


Asp 


Leu 


Val Val 


Ser 


Val 


Gly Gly Thr Gly 


Val 


65 




70 








75 




80 


Thr Pro 


Arg 


Asp 


Val 


Thr 


Pro 


Glu Ala 


Thr 


Arg 


Asp He Leu Asp 


Arg 




85 








90 




95 




Glu He 


Leu 


Gly 


He 


Ala 


Glu 


Ala He 


Arg 


Ala 


Ser Gly Leu Ser 


Ala 






100 








105 






110 




Gly He 


Val 


Asp 


Ala 


Gly 


Leu 


Ser Arg Gly Leu Ala Gly Val Ser 


Gly 


115 










120 






125 




Ser Thr 


Leu 


Val 


Val 


Asn 


Leu 


Ala Gly 


Ser Arg 


Tyr Ala Val Arg 


Asp 


130 










135 








140 




Gly Met 


Ala 


Thr 


Leu 


Asn 


Pro 


Leu Ala 


Ala 


Gin 


He He Gly Gin 


Leu 


145 








150 








155 




160 
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Ser Ser Leu Glu He 
165 

<210> 11 
<211> 495 
<212> DNA 

<213> M. Tuberculosis 



<220> 
<221> CDS 
<222> (1) . 



. (492) 



<400> 11 

atq acq gat act caa gtc acc tgg ttg acc caa gag tea cat gac cga 

Met Thr Asp Thr Gin Val Thr Trp Leu Thr Gin Glu Ser His Asp Arg 
1 5 10 15 

etc aaa gca gag etc gac cag ctg att gcg aat cgc ccg gtc ate gee 
Leu Lys Ala Glu Leu Asp Gin Leu He Ala Asn Arg Pro Val lie Ala 
20 25 30 

gec gaa ate aac gac cgc cgc gaa gaa ggc gac ctg cgc gag aac ggc 
Ala Glu lie Asn Asp Arg Arg Glu Glu Gly Asp Leu Arg Glu Asn Gly 
35 40 45 

gga tac cac gee gec cgc gag gag cag ggc cag cag gag gee cgc att 
Gly Tyr His Ala Ala Arg Glu Glu Gin Gly Gin Gin Glu Ala Arg lie 
50 . 55 60 

cqc cag ctg cag gac ttg etc age aac gca aag gtt ggc. gag gca ccc 
Arg Gin Leu Gin Asp Leu Leu Ser Asn Ala Lys Val Gly Glu Ala Pro 
65 "?0 75 80 

aag caa tec ggc gtc gca tta ccc ggt tct gtg gtc aag gtg tac tac 
Lys Gin Ser Gly Val Ala Leu Pro Gly Ser Val Val Lys Val Tyr Tyr 
85 90 95 

aac ggc gac aag teg gac age gaa acg ttc etc ate gee acc cgc cag 
Asn Gly Asp Lys Ser Asp Ser Glu Thr Phe Leu lie Ala Thr Arg Gin 
100 IO 5 110 

gag ggc gtc age gac ggc aag etc gag gtc tac teg ccg aat tea ccg 
Glu Gly Val Ser Asp Gly Lys Leu Glu Val Tyr Ser Pro Asn Ser Pro 
115 ' 120 125 

etc ggt ggg gee ctg ate gac gee aag gtc ggc gag acc cgc age tac 
Leu Gly Gly Ala Leu lie Asp Ala Lys Val Gly Glu Thr Arg Ser Tyr 
130 ' 135 140 

acg gtg ccc aac ggc age acc gtg teg gtg acc eta gtc age gec gag 
Thr Val Pro Asn Gly Ser Thr Val Ser Val Thr Leu Val Ser Ala Glu 
145 150 155 > 60 

ccg tac cac tec tag 
Pro Tyr His Ser 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



495 



<210> 12 
<211> 164 
<212> PRT 
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<213> M. Tuberculosis 



<400> 12 



Met 


Thr 


Asp 


Thr 


Gin 


val 


inr 


Trp 


Leu 


Thr 


Gin 


PI n 

V3lU 


Cq v- 


ni s 




Arg 


1 






c 
D 










10 










1 R 




Leu 


Lys 


Ala 


Glu 


Leu 


Asp 


bin 


Leu 


T 1 a. 

lie 


Ala 


Asn 


Arg 


Prri 
IT IT L> 


Vdl 


IT A 

lie 


Al ^ 
















25 
















Ala 


Glu 


lie 


Asn 


Asp 


Arg 


Arg 




Glu 


Gly Asp 


Leu 


Arg 


Pill 
uiU 


Asn 


Pi \7 

biy 






35 










40 










45 








Gly 


Tyr 


His 


TV 1 

Ala 


TV "1 

Aia 


Arg 


pi »i 


pi ii 


Gin 


Gly Gin 


Pin 


p 1 n 


7\ 1 -i 
r\J- d 


rli. y 


Tip 
j. j. t; 


50 






























Arg 


Gin 


Leu 


Gin 


Asp 


Leu 


Leu 


oer 


Asn 


Ala 


Lys 


Vdx 


P 1 \/ 


PI n 


Ala 
nld 


D y- r> 


65 










70 










75 










80 


Lys 


Gin 


Ser 


Gly 


Val 


Ala 


Leu 


Pro 


Gly Ser 


Val 


val 


Lys 


Vdl 


iyr 


iyr 








o b 










90 














Asn 


Gly 


T\ _ _ 

Asp 


Lys 
i nn 


Ser 


Asp 


oer 


Pin 


Thr 
105 


Phe 


Leu 




AT a 


Th r 

110 


rt J- y 


PI n 


Glu 


Gly 


Val 


Ser 


Asp 


Gly 


Lys 


Leu 


Glu 


Val 


Tyr 


Ser 


Pro 


Asn 


Ser 


Pro 




115 










120 










125 








Leu 


Gly 
130 


Gly 


Ala 


Leu 


He 


Asp 
135 


Ala 


Lys 


Val 


Gly 


Glu 
140 


Thr 


Arg 


Ser 


Tyr 


Thr 


Val 


Pro 


Asn 


Gly 


Ser 


Thr 


Val 


Ser 


Val 


Thr 


Leu 


Val 


Ser 


Ala 


Glu 


145 










150 










155 










160 


Pro 


Tyr 


His 


Ser 



























<210> 13 
<211> 558 
<212> DNA 

<213> M. Tuberculosis 



<220> 

<221> CDS 

<222> (1}... (555) 



<400> 13 

atg att gat gag get etc ttc gac gec gaa gag aaa atg gag aag get 48 

Met lie Asp Glu Ala Leu Phe Asp Ala Glu Glu Lys Met Glu Lys Ala 

15 10 15 

gtg gcg gtg gca cgt gac gac ctg tea act ate cgt acc ggc cgc gec 96 
Val Ala Val Ala Arg Asp Asp Leu Ser Thr He Arg Thr Gly Arg Ala 
20 25 30 



aac cct ggc atg ttc tct egg ate acc ate gac tac tac ggt gcg gec 144 
Asn Pro Gly Met Phe Ser Arg He Thr He Asp Tyr Tyr Gly Ala Ala 
35 40 45 



acc ccg ate acg caa ctg gec age ate aat gtc ccc gag gcg egg eta 192 
Thr Pro He Thr Gin Leu Ala Ser He Asn Val Pro Glu Ala Arg Leu 
50 55 60 



gtc gtg ata aag ccg tat gaa gec aat cag ttg cgc get ate gag act 240 
Val Val He Lys Pro Tyr Glu Ala Asn Gin Leu Arg Ala He Glu Thr 
65 70 75 80 



gca att cgc aac tec gac ctt gga gtg aat ccc acc aac gac ggc gec 288 
Ala He Arg Asn Ser Asp Leu Gly Val Asn Pro Thr Asn Asp Gly Ala 
85 90 95 

ctt att cgc gtg gec gta ccg cag etc acc gaa gaa cgt egg cga gag 336 



WO 00/21983 



PCT/DK99/00538 



Leu He Arg Val Ala Val Pro Gin Leu Thr Glu Glu Arg Arg Arg Glu 
100 105 110 

ctg gtc aaa cag gca aag cat aag ggg gag gag gcc aag gtt teg gtg 384 
Leu Val Lys Gin Ala Lys His Lys Gly Glu Glu Ala Lys Val Ser Val 
115 120 125 

cgt aat ate cgt cgc aaa gcg atg gag gaa etc cat cgc ate cgt aag 4 32 

Arg Asn He Arg Arg Lys Ala Met Glu Glu Leu His Arg He Arg Lys 
130 135 . 140 

gaa ggc gag gcc ggc gag gat gag gtc ggt cgc gca gaa aag gat etc 480 
Glu Gly Glu Ala Gly Glu Asp Glu Val Gly Arg Ala Glu Lys Asp Leu 



145 



150 155 160 



gac aag acc acg cac caa tac gtc ace caa att gat gag ctg gtt aaa 

Asp Lys Thr Thr His Gin Tyr Val Thr Gin He Asp Glu Leu Val Lys 
165 170 175 

cac aaa gaa ggc gag ctg ctg gag gtc tag 

His Lys Glu Gly Glu Leu Leu Glu Val 

180 185 



528 



558 



<210> 14 
<211> 185 
<212> PRT 

<213> M. Tuberculosis 
<400> 14 

Met He Asp Glu Ala Leu Phe Asp Ala Glu Glu Lys Met Glu Lys Ala 

1 5 10 15 

Val Ala Val Ala Arg Asp Asp Leu Ser Thr He Arg Thr Gly Arg Ala 

20 " 25 30 

Asn Pro Gly Met Phe Ser Arg He Thr He Asp Tyr Tyr Gly Ala Ala 

35 4 0 4 5 

Thr Pro He Thr Gin Leu Ala Ser He Asn Val Pro Glu Ala Arg Leu 

50 55 60 

Val Val He Lys Pro Tyr Glu Ala Asn Gin Leu Arg Ala He Glu Thr 
65 70 75 80 

Ala He Arg Asn Ser Asp Leu Gly Val Asn Pro Thr Asn Asp Gly Ala 

85 90 95 

Leu He Arg Val Ala Val Pro Gin Leu Thr Glu Glu Arg Arg Arg Glu 

100 105 HO 

Leu Val Lys Gin Ala Lys His Lys Gly Glu Glu Ala Lys Val Ser Val 

115 120 125 

Arg Asn He Arg Arg Lys Ala Met Glu Glu Leu His Arg He Arg Lys 

130 135 140 

Glu Gly Glu Ala Gly Glu Asp Glu Val Gly Arg Ala Glu Lys Asp Leu 
145 150 155 160 

Asp Lys Thr Thr His Gin Tyr Val Thr Gin He Asp Glu Leu Val Lys 

165 . 170 175 

His Lys Glu Gly Glu Leu Leu Glu Val 
180 185 

<210> 15 
<211> 651 
<212> DNA 

<213> M. Tuberculosis 



<220> 



WO 00/21983 



10 



PCT/DK99/00538 



<221> CDS 

<222> (1) . . . (648) 

<400> 15 

atg gcc gac ate gat ggt gta acc ggt teg gcg ggt ctg cag cct ggg 48 

Met Ala Asp He Asp Gly Val Thr Gly Ser Ala Gly Leu Gin Pro Gly 
1 5 10 15 

ccg tct gag gag aca gac gag gag ttg acc gcg cgt ttc gag cgc gac 96 
Pro Ser Glu Glu Thr Asp Glu Glu Leu Thr Ala Arg Phe Glu Arg Asp 
20 25 30 

gcg att ccc ctg ttg gac cag ctg tac ggc ggt gcg ctg egg atg acg 144 
Ala He Pro Leu Leu Asp Gin Leu Tyr Gly Gly Ala Leu Arg Met Thr 
35 40 45 

cgc aat ccg gcc gac gcc gag gac ttg etc cag gag acg atg gtg aag 192 
Arg Asn Pro Ala Asp Ala Glu Asp Leu Leu Gin Glu Thr Met Val Lys 
50 55 60 

gcc tat gcg gga ttt cgt teg ttc egg cac ggt acc aat etc aag gcc 240 
Ala Tyr Ala Gly Phe Arg Ser Phe Arg His Gly Thr Asn Leu Lys Ala 
65 70 75 80 

tgg etc tac egg ata ctg acc aac acc tac ate aac age tat cgc aag 288 
Trp Leu Tyr Arg He Leu Thr Asn Thr Tyr He Asn Ser Tyr Arg Lys 
85 90 95 

aaa cag egg caa ccg gcg gag tat ccg acc gag cag ate acc gat tgg 336 
Lys Gin Arg Gin Pro Ala Glu Tyr Pro Thr Glu Gin lie Thr Asp Trp 
100 105 110 

caa ctg gcg tec aac gcc gag cat tec teg acc ggg ctg cgc teg get 384 
Gin Leu Ala Ser Asn Ala Glu His Ser Ser Thr Gly Leu Arg Ser Ala 
115 120 125 

gaa gtc gaa gcg tta gaa gcg ttg ccg gac acc gag ate aaa gag gcg 432 
Glu Val Glu Ala Leu Glu Ala Leu Pro Asp Thr Glu He Lys Glu Ala 
130 135 140 

ctg cag gca ttg ccg gaa gag ttc egg atg gcg gtc tac tac gcc gat 480 
Leu Gin Ala Leu Pro Glu Glu Phe Arg Met Ala Val Tyr Tyr Ala Asp 
145 150 155 " * 160 

gtc gaa ggt ttc ccc tac aag gag ate gcc gag ate atg gat act ccg 528 
Val Glu Gly Phe Pro Tyr Lys Glu He Ala Glu He Met Asp Thr Pro 
165 170 175 

ate ggc acc gtg atg teg agg ctt cat cgc ggc cga cgt cag ttg cgc 57 6 

He Gly Thr Val Met Ser Arg Leu His Arg Gly Arg Arg Gin Leu Arg 
180 185 190 

ggt ctt tta gcc gat gtg gcc agg gat egg ggg ttt gcc agg ggc gag 624 
Gly Leu Leu Ala Asp Val Ala Arg Asp Arg Gly Phe Ala Arg Gly Glu 
195 200 205 



cag gcg cac gag ggg gtg teg tea tga 
Gin Ala His Glu Gly Val Ser Ser 
210 215 



651 
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<210> 16 
<211> 216 
<212> PRT 

<213> M. Tuberculosis 



<400> 16 



Met 


Ala 


Asp 


He 


Asp 


Gly 


Val 


Thr 


Gly 


Ser 


Ala Gly Leu Gin 


Pro 


biy 


1 






5 










10 








1 D 




Pro 


Ser 


Glu 


Glu 


Thr 


Asp 


Glu 


Glu 


Leu 


Thr 


Ala Arg 


Phe 


Glu 


Arg 


Asp 






20 










25 








30 






Ala 


He 


Pro 


Leu 


Leu 


Asp 


Gin Leu Tyr Gly Gly Ala 


Leu Arg 


Met 


Thr 






35 










40 








45 




Val 




Arg 


Asn 


Pro 


Ala 


Asp 


Ala 


Glu Asp 


Leu 


Leu 


Gin Glu 


Thr 


Met 


Lys 


50 










55 








60 










Ala Tyr Ala Gly 


Phe 


Arg 


Ser 


Phe 


Arg 


His 


Gly Thr Asn 


Leu 


Lys 


Ala 


65 










70 










75 








80 


Trp 


Leu 


Tyr Arg 


He 


Leu 


Thr 


Asn 


Thr 


Tyr 


He Asn 


Ser 


Tyr 


Arg 


Lys 








85 










90 








95 




Lys 


Gin Arg 


Gin 


Pro 


Ala 


Glu 


Tyr 


Pro 


Thr 


Glu Gin 


He 


Thr 


Asp 


Trp 






100 










105 








110 






Gin 


Leu 


Ala 


Ser 


Asn 


Ala 


Glu 


His 


Ser 


Ser 


Thr Gly 


Leu 


Arg 


Ser 


Ala 




115 










120 








125 








Glu 


Val 
130 


Glu 


Ala 


Leu 


Glu 


Ala 
135 


Leu 


Pro 


Asp 


Thr Glu 
140 


He 


Lys 


Glu 


Ala 


Leu 


Gin 


Ala 


Leu 


Pro 


Glu 


Glu 


Phe 


Arg 


Met 


Ala Val 


Tyr 


Tyr 


Ala 


Asp 


145 










150 










155 








160 


Val 


Glu 


Gly 


Phe 


Pro 


Tyr 


Lys 


Glu 


He 


Ala 


Glu He 


Met 


Asp 


Thr 


Pro 








165 










170 








175 




He 


Gly 


Thr 


Val 


Met 


Ser 


Arg 


Leu 


His Arg Gly Arg Arg 


Gin 


Leu 


Arg 






180 










185 








190 






Gly 


Leu 


Leu 


Ala 


Asp 


Val 


Ala Arg Asp Arg Gly Phe 


Ala 


Arg 


Gly Glu 




195 










200 








205 








Gin 


Ala 


His 


Glu 


Gly 


Val 


Ser 


Ser 

















210 215 



<210> 17 
<211> 774 
<212> DNA 

<213> M. Tuberculosis 



<220> 

<221> CDS 

<222> (1) . . . (771) 



<400> 17 

atg acg tac gaa acc ate ctg gtc gag cgc gat cag cga gtt ggc att 
Met Thr Tyr Glu Thr He Leu Val Glu Arg Asp Gin Arg Val Gly He 
15 10 15 

ate acg ctg aac cgt ccc cag gca ctg aac gcg etc aac age cag gtg 
He Thr Leu Asn Arg Pro Gin Ala Leu Asn Ala Leu Asn Ser Gin Val 
20 ~ 25 30 

atg aac gag gtc acc age get gca acc gaa ctg gac gat gac ccg gac 
Met Asn Glu Val Thr Ser Ala Ala Thr Glu Leu Asp Asp Asp Pro Asp 
35 40 45 



48 



96 



144 



att ggg gcg ate ate ate acc ggt teg gee aaa gcg ttt gee gee gga 192 
He Gly Ala He He He Thr Gly Ser Ala Lys Ala Phe Ala Ala Gly 
50 55 60 
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gcc gac ate aaa gaa atg gec gac ctg acg ttc gec gac gcg ttc acc 240 
Ala Asp lie Lys Glu Met Ala Asp Leu Thr Phe Ala Asp Ala Phe Thr 
65 70 75 80 

gcc gac ttc ttc gcc acc tgg ggc aag ctg gcc gcc gtg cgc acc ccg 288 
Ala Asp Phe Phe Ala Thr Trp Gly Lys Leu Ala Ala Val Arg Thr Pro 
85 90 95 

acg ate gcc gcg gtg gcg gga tac gcg etc ggc ggt ggc tgc gag ctg 336 
Thr lie Ala Ala Val Ala Gly Tyr Ala Leu Gly Gly Gly Cys Glu Leu 
100 105 110 

gcg atg atg tgc gac gtg ctg ate gcc gcc gac acc gcg aag ttc gga 384 
Ala Met Met Cys Asp Val Leu lie Ala Ala Asp Thr Ala Lys Phe Gly 
115 120 125 

cag ccc gag ata aag ctg ggc gtg ctg cca ggc atg ggc ggc tec cag 432 
Gin Pro Glu lie Lys Leu Gly Val Leu Pro Gly Met Gly Gly Ser Gin 
130 135 140 

egg ctg acc egg get ate ggc aag get aag gcg atg gac etc ate ctg 480 
Arg Leu Thr Arg Ala lie Gly Lys Ala Lys Ala Met Asp Leu lie Leu 
145 150 155 160 

acc ggg cgc acc atg gac gcc gcc gag gcc gag cgc age ggt ctg gtt 528 
Thr Gly Arg Thr Met Asp Ala Ala Glu Ala Glu Arg Ser Gly Leu Val 
165 170 ^ 175 

tea egg gtg gtg ccg gcc gac gac ttg ctg acc gaa gcc agg gcc act 576 
Ser Arg Val Val Pro Ala Asp Asp Leu Leu Thr Glu Ala Arg Ala Thr 
180 185 190 

gcc acg acc att teg cag atg teg gcc teg gcg gcc egg atg gcc aag 624 
Ala Thr Thr lie Ser Gin Met Ser Ala Ser Ala Ala Arg Met Ala Lys 
195 200 205 

gag gcc gtc aac egg get ttc gaa tec agt ttg tec gag ggg ctg etc 672 
Glu Ala Val Asn Arg Ala Phe Glu Ser Ser Leu Ser Glu Gly Leu Leu 
210 215 220 

tac gaa cgc egg ctt ttc cat teg get ttc gcg acc gaa gac caa tec 720 
Tyr Glu Arg Arg Leu Phe His Ser Ala Phe Ala Thr Glu Asp Gin Ser 
225 230 235 240 

gaa ggt atg gca gcg ttc ate gag aaa cgc get ccc cag ttc acc cac 768 
Glu Gly Met Ala Ala Phe lie Glu Lys Arg Ala Pro Gin Phe Thr His 
245 250 255 

cga tga 774 
Arg 



<210> 18 
<211> 257 
<212> PRT 

<213> M. Tuberculosis 
<400> 18 

Met Thr Tyr Glu Thr He Leu Val Glu Arg Asp Gin Arg Val Gly He 
15 10 15 
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He 


Thr 


Leu 


Asn Arg 


Pro 


Gin 


Ala 


Leu 


Asn 


Ala 


Leu 


Asn 


Ser 


Gin 


Val 








20 










25 










30 






Met 


Asn 


Glu 


Val 


Thr 


Ser 


Ala 


Ala 


Thr 


Glu 


Leu 


Asp 


Asp 


Asp 


Pro 


Asp 






35 










40 










45 








He 


Gly Ala 


He 


He 


He 


Thr 


Gly Ser Ala 


Lys 


Ala 


Phe 


Ala 


Ala 


Gly 




50 










55 










60 










Ala 


Asp 


He 


Lys 


Glu 


Met 


Ala 


Asp 


Leu 


Thr 


Phe 


Ala 


Asp 


Ala 


Phe 


Thr 


65 








70 










75 










80 


Ala 


Asp 


Phe 


Phe 


Ala 


Thr 


Trp 


Gly Lys 


Leu 


Ala 


Ala 


Val 


Arg 


Thr 


Pro 








85 










90 










95 




Thr 


He 


Ala 


Ala 


Val 


Ala 


Gly 


Tyr 


Ala 


Leu 


Gly 


Gly 


Gly 


Cys 


Glu 


Leu 








100 










105 










110 






Ala 


Met 


Met 


Cys Asp 


Val 


Leu 


He 


Ala 


Ala 


Asp 


Thr 


Ala 


Lys 


Phe 


Gly 






115 










120 










125 








Gin 


Pro 


Glu 


lie Lys 


Leu 


Gly 


Val 


Leu 


Pro 


Gly 


Met 


Gly 


Gly 


Ser 


Gin 




130 










135 










140 










Arg 


Leu 


Thr 


Arg Ala 


He 


Gly 


Lys 


Ala 


Lys 


Ala 


Met 


Asp 


Leu 


He 


Leu 


145 










150 










155 










160 


Thr 


Gly Arg 


Thr 


Met 


Asp 


Ala 


Ala 


Glu 


Ala 


Glu 


Arg 


Ser 


Gly 


Leu 


Val 










165 










170 










175 




Ser 


Arg 


Val 


Val 


Pro 


Ala 


Asp 


Asp 


Leu 


Leu 


Thr 


Glu 


Ala 


Arg 


Ala 


Thr 






180 










185 










190 






Ala 


Thr 


Thr 


He 


Ser 


Gin 


Met 


Ser 


Ala 


Ser 


Ala' 


Ala 


Arg 


Met 


Ala 


Lys 






195 










200 










205 








Glu 


Ala 


Val 


Asn 


Arg 


Ala 


Phe 


Glu 


Ser 


Ser 


Leu 


Ser 


Glu 


Gly 


Leu 


Leu 




210 










215 










220 










Tyr 


Glu 


Arg 


Arg 


Leu 


Phe 


His 


Ser 


Ala 


Phe 


Ala 


Thr 


Glu 


Asp 


Gin 


Ser 


225 










230 










235 










240 


Glu 


Gly Met 


Ala 


Ala 


Phe 


He 


Glu 


Lys 


Arg 


Ala 


Pro 


Gin 


Phe 


Thr 


His 










245 










250 










255 





Arg 



N 

<210> 19 

<211> 894 

<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1)...<891) 

<400> 19 

gtg ccg ctt ccc gca gac cct age ccc ace ttg teg gec tac gee cat 48 

Val Pro Leu Pro Ala Asp Pro Ser Pro Thr Leu Ser Ala Tyr Ala His 
1 5 .10 15 

ccc gaa egg etc gtg acc gec gac tgg ttg teg gca cac atg ggc gcg 96 
Pro Glu Arg Leu Val Thr Ala Asp Trp Leu Ser Ala His Met Gly Ala 
20 25 30 

ccg ggc ctg gcg ate gtc gaa tec gac gag gac gtc.ttg etc tac gac 144 
Pro Gly Leu Ala He Val Glu Ser Asp Glu Asp Val Leu Leu Tyr Asp 
35 40 45 

gtc ggc cat att ccc ggc gec gtc aag ate gac tgg cac acc gac etc 192 
Val Gly His He Pro Gly Ala Val Lys He Asp Trp His Thr Asp Leu 
50 55 . 60 

aac gac cca egg gtg cgc gac tac ate aac ggc gag cag ttc gee gaa 24 0 

Asn Asp Pro Arg Val Arg Asp Tyr He Asn Gly Glu Gin Phe Ala Glu 
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65 70 75 80 

ttg atg gac cgc aag ggc ate gec cgc gat gac acc gtg gtg ate tat 288 
Leu Met Asp Arg Lys Gly lie Ala Arg Asp Asp Thr Val Val lie Tyr 
85 90 95 

ggc gac aag age aat tgg tgg gee gee tat gcg ttg tgg gtg ttc acq 336 
Gly Asp Lys Ser Asn Trp Trp Ala Ala Tyr Ala Leu Trp Val Phe Thr 
100 105 110 

ctg ttc ggt cac gee gac gtg cga etc etc aac ggc ggc cgt gac etc 384 
Leu Phe Gly His Ala Asp Val Arg Leu Leu Asn Gly Gly Arg Asp Leu 
115 120 " 125 

tgg etc gee gag cgc egg gaa acc acc ttg gac gtc ccg acc aag acc 432 
Trp Leu Ala Glu Arg Arg Glu Thr Thr Leu Asp Val Pro Thr Lys Thr 
130 135 140 

tgc acc ggt tat ccc gtc gtg cag cgc aac gat gca ccc ate cgc gca 480 
Cys Thr Gly Tyr Pro Val Val Gin Arg Asn Asp Ala Pro lie Arg Ala 
145 150 155 160 

ttc aga gac gac gtg ctg gee ate ctg ggc get cag ccg ctg ate gac 528 
Phe Arg Asp Asp Val Leu Ala He Leu Gly Ala Gin Pro Leu He Asp 
165 170 175 

gta cgc tct ccc gag gag tac acc ggc aag cgc acc cat atg ccc gat 57 6 

Val Arg Ser Pro Glu Glu Tyr Thr Gly Lys Arg Thr His Met Pro Asp 
180 185 190 

tac ccc gag gaa ggg gcg ctg egg gec ggt cac ate ccc acg gcg gtg 624 
Tyr Pro Glu Glu Gly Ala Leu Arg Ala Gly His He Pro Thr Ala Val 
195 200 205 

cac att ccg tgg ggg aag gee gee gac gaa agt gga egg ttt cgc age 672 
His He Pro Trp Gly Lys Ala Ala Asp Glu Ser Gly Arg Phe Arg Ser 
210 215 220 

cgc gag gaa ttg gaa egg etc tat gac ttc ata aac ccg gac gac caa 720 
Arg Glu Glu Leu Glu Arg Leu Tyr Asp Phe He Asn Pro Asp Asp Gin 
225 230 235 240 

acc gtc gtc tat tgc cgc ate ggt gaa cgc tec age cat acc tgg ttc 768 
Thr Val Val Tyr Cys Arg He Gly Glu Arg Ser Ser His Thr Trp Phe 
245 250 255 

gtg etc aca cac ctg ctg ggc aag gca gat gta egg aac tac gac ggc 816 
Val Leu Thr His Leu Leu Gly Lys Ala Asp Val Arg Asn Tyr Asp Gly 
260 265 270 

teg tgg acc gag tgg ggc aac gee gtg cga gtg ccg ate gtc gcg ggc 864 
Ser Trp Thr Glu Trp Gly Asn Ala Val Arg Val Pro He Val Ala Gly 
275 280 285 

gaa gaa cca gga gtg gta ccc gtc gta tga 894 
Glu Glu Pro Gly Val Val Pro Val Val 
290 295 



<210> 20 
<211> 297 
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15 



PCT/DK99/00538 



<212> PRT 

<213> M. Tuberculosis 



<400> 20 



Met 


Pro 


Leu 


Pro 


Ala 


Asp 


Pro 


Ser 


1 








5 








Pro 


Glu 


Arg 


Leu 
20 


Val 


Thr 


Ala 


Asp 


Pro 


Gly 


Leu 
35 


Ala 


He 


Val 


Glu 


Ser 
40 


Val 


Gly 


His 


He 


Pro 


Gly Ala 


Val 




50 










55 




Asn 


Asp 


Pro 


Arg 


Val 


Arg Asp Tyr 


65 










70 






Leu 


Met 


Asp 


Arg 


Lys 
85 


Gly 


lie 


Ala 


Gly 


Asp 


Lys 


Ser 
100 


Asn 


Trp 


Trp 


Ala 


Leu 


Phe 


Gly 
115 


His 


Ala 


Asp 


Val 


Arg 
120 


Trp 


Leu 
130 


Ala 


Glu 


Arg 


Arg 


Glu 
135 


Thr 


Cys 


Thr 


Gly 


Tyr 


Pro 


Val 


Val 


Gin 


145 










150 






Phe 


Arg 


Asp 


Asp 


Val 
165 


Leu 


Ala 


He 


Val 


Arg 


Ser 


Pro 
180 


Glu 


Glu 


Tyr 


Thr 


Tyr 


Pro 


Glu 


Glu 


Gly 


Ala 


Leu Arg 




195 










200 


His 


He 
210 


Pro 


Trp 


Gly 


Lys 


Ala 
215 


Ala 


Arg 


Glu 


Glu 


Leu 


Glu 


Arg 


Leu 


Tyr 


225 










230 






Thr 


Val 


Val 


Tyr 


Cys 
245 


Arg 


He 


Gly 


Val 


Leu 


Thr 


His 


Leu 


Leu Gly Lys 








260 










Ser 


Trp 


Thr 


Glu 


Trp 


Gly Asn 


Ala 






275 










280 


Glu 


Glu 
290 


Pro 


Gly 


Val 


Val 


Pro 
295 


Val 



Pro 


Thr Leu 


Ser Ala 


Tyr 


Ala His 




10 






15 


Trp 


Leu Ser 


Ala His 


Met 


Gly Ala 


25 






30 




Asp 


Glu Asp 


Val Leu 


Leu 


Tyr Asp 






45 






Lys 


He Asp 


Trp His 


Thr 


Asp Leu 






60 






He 


Asn Gly Glu Gin 


Phe 


Ala Glu 




75 






80 


Arg 


Asp Asp 


Thr Val 


Val 


He Tyr 




90 






95 


Ala 


Tyr Ala 


Leu Trp Val 


Phe Thr 


105 






110 




Leu 


Leu Asn Gly Gly Arg 


Asp Leu 






125 






Thr 


Leu Asp 


Val Pro 


Thr 


Lys Thr 






140 






Arg 


Asn Asp' Ala Pro 


He 


Arg Ala 




155 






160 


Leu 


Gly Ala 


Gin Pro 


Leu 


He Asp 




170 






175 


Gly 


Lys Arg 


Thr His 


Met 


Pro Asp 


185 






190 




Ala 


Gly His 


He Pro 


Thr 


Ala Val 






205 






Asp 


Glu Ser 


Gly Arg 


Phe 


Arg Ser 






220 






Asp 


Phe He 


Asn Pro 


Asp 


Asp Gin 




235 






240 


Glu 


Arg Ser 


Ser His 


Thr 


Trp Phe 




250 






255 


Ala 


Asp Val 


Arg Asn 


Tyr 


Asp Gly 


265 






270 




Val 


Arg Val 


Pro He 


Val 


Ala Gly 






285 






Val 











<210> 21 
<211> 1044 
<212> DNA 

<213> M.Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . • (1041) 



<400> 21 

atg ctg ate tea cag cgc ccc acc ctg tec gag gac gtc etc acc gac 48 

Met Leu He Ser Gin Arg Pro Thr Leu Ser Glu Asp Val Leu Thr Asp 
1 5 10 15 

aac cga tec cag ttc gtg ate gaa ccg ctg gag ccg gga ttc ggc tac 96 

Asn Arg Ser Gin Phe Val He Glu Pro Leu Glu Pro Gly Phe Gly Tyr 
20 25 30 
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acc ctg ggc aat teg ctg cgt cgc acc ctg ctg teg teg att ccc gga 144 
Thr Leu Gly Asn Ser Leu Arg Arg Thr Leu Leu Ser Ser lie Pro Gly 
35 40 45 

gcg gee gtc acc age att cgc ate gat ggt gta ctg cac gaa ttc acc 192 
Ala Ala Val Thr Ser He Arg He Asp Gly Val Leu His Glu Phe Thr 
50 55 60 

acg gtg ccc ggg gtc aaa gaa gat gtc acc gag ate ate ctg aat etc 240 
Thr Val Pro Gly Val Lys Glu Asp Val Thr Glu He He Leu Asn Leu 
65 70 75 80 

aag age ctg gtg gtg tec teg gag gag gac gag ccg gtc acc atg tac 288 
Lys Ser Leu Val Val Ser Ser Glu Glu Asp Glu Pro Val Thr Met Tyr 
85 90 95 

eta cgc aag cag ggt ccg ggt gag gtt acc gee ggc gac ate gtg ccg 336 
Leu Arg Lys Gin Gly Pro Gly Glu Val Thr Ala Gly Asp He Val Pro 
100 105 " 110 

ccg gee ggc gtc acc gtg cac aac ccc ggc atg cac ate gee acg ctg 384 
Pro Ala Gly Val Thr Val His Asn Pro Gly Met His He Ala Thr Leu 
115 120 125 

aac gat aag ggc aag ctg gaa gtc gag etc gtc gtc gag cgt ggc cgc 432 
Asn Asp Lys Gly Lys Leu Glu Val Glu Leu Val Val Glu Arg Gly Arg 
130 135 140 

ggc tat gtc ccg gcg gtg caa aac egg get teg ggt gee gaa att ggg 480 
Gly Tyr Val Pro Ala Val Gin Asn Arg Ala Ser Gly Ala Glu He Gly 
145 150 155 160 

cgc att cca gtc gat tec ate tac tea ccg gtg etc aaa gtg acc tac 528 
Arg He Pro Val Asp Ser He Tyr Ser Pro Val Leu Lys Val Thr Tyr 
165 170 175 

aag gtg gac gec acc egg gtc gag cag cgc acc gac ttc gac aag ctg 576 
Lys Val Asp Ala Thr Arg Val Glu Gin Arg Thr Asp Phe Asp Lys Leu 
180 185 190 

ate ctg gac gtg gag acc aag aat tea ate age ccg cgc gac gcg ctg 624 
lie Leu Asp Val Glu Thr Lys Asn Ser He Ser Pro Arg Asp Ala Leu 
195 200 205 

gcg teg get ggc aag acg ctg gtc gag ttg ttc ggc ctg gca egg gaa 672 
Ala Ser Ala Gly Lys Thr Leu Val Glu Leu Phe Gly Leu Ala Arg Glu 
210 215 220 

etc aac gtc gag gee gaa ggc ate gag ate ggg ccg teg ccg gee gag 720 
Leu Asn Val Glu Ala Glu Gly He Glu He Gly Pro Ser Pro Ala Glu 
225 230 235 240 

gee gat cac att gcg tea ttc gec ctg ccg ate gac gac ctg gat ctg 768 
Ala Asp His He Ala Ser Phe Ala Leu Pro He Asp Asp Leu Asp Leu 
245 250 255 

acg gtg egg tec tac aac tgc etc aag cgc gag ggg gtg cac acc gtg 816 
Thr Val Arg Ser Tyr Asn Cys Leu Lys Arg Glu Gly Val His Thr Val 
260 265 270 



ggc gaa ctg gtg gcg cgc acc gaa tec gac ctg ctt gac ate cgc aac 



864 
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Gly Glu Leu Val Ala Arg Thr Glu Ser Asp Leu Leu Asp lie Arg Asn 

275 280 285 

ttc ggt cag aag tec ate gac gag gtg aag ate aag ctg cac cag ctg 912 

Phe Gly Gin Lys Ser lie Asp Glu Val Lys He Lys Leu His Gin Leu 
290 295 300 

ggc ctg tea etc aag gac age ccg ccg age ttc gac ccc teg gag gtc 

Gly Leu Ser Leu Lys Asp Ser Pro Pro Ser Phe Asp Pro Ser Glu Val 
305 ^ 310 315 320 

gcg ggc tac gac gtc gec ace ggc acc tgg teg ace gag ggc gcg tac 

Ala Gly Tyr Asp Val Ala Thr Gly Thr Trp Ser Thr Glu Gly Ala Tyr 

325 330 335 

gac gag cag gac tac gee gaa acc gaa cag ctt tag 1044 

Asp Glu Gin Asp Tyr Ala Glu Thr Glu Gin Leu 

340 345 



960 



1008 



<210> 22 
<211> 347 
<212> PRT 

<213> M. Tuberculosis 



<400> 22 



Met 


Leu 


He 


Ser 


Gin 


Arg 


Pro 


Thr 


Leu 


Ser 


Glu 


Asp 


Val 


Leu 


Thr 


Asp 


1 








5 










10 










15 




Asn Arg 


Ser 


Gin 


Phe 


Val 


He 


Glu 


Pro 


Leu 


Glu 


Pro 


Gly 


Phe 


Gly 


Tyr 








20 










25 










30 






Thr 


Leu 


Gly 


Asn 


Ser 


Leu 


Arg 


Arg Thr 


Leu 


Leu 


Ser 


Ser 


He 


Pro 


Gly 






35 










40 










4 5 








Ala 


Ala 
50 


Val 


Thr 


Ser 


He 


Arg 
55 


He 


Asp 


Gly 


Val 


Leu 
60 


His 


Glu 


Phe 


Thr 


Thr 


Val 


Pro 


Gly 


Val 


Lys 


Glu 


Asp 


Val 


Thr 


Glu 


He 


lie* 


Leu 


Asn 


Leu 


65 








70 










75 










80 


Lys 


Ser 


Leu 


Val 


Val 


Ser 


Ser 


Glu 


Glu 


Asp 


Glu 


Pro 


Val 


Thr 


Met 


Tyr 








85 










90 










95 




Leu 


Arg 


Lys 


Gin 


Gly 


Pro 


Gly 


Glu 


Val 


Thr 


Ala 


Gly 


Asp 


He 


Val 


Pro 






100 










105 










110 






Pro 


Ala 


Gly 
115 


Val 


Thr 


Val 


His 


Asn 
120 


Pro 


Gly 


Met 


His 


He 
125 


Ala 


Thr 


Leu 


Asn Asp 


Lys 


Gly 


Lys 


Leu 


Glu 


Val 


Glu 


Leu 


Val 


Val 


Glu 


Arg 


Gly 


Arg 




130 










135 










140 










Gly Tyr 


Val 


Pro 


Ala 


Val 


Gin 


Asn Arg 


Ala 


Ser 


Gly 


Ala 


Glu 


He 


Gly 


145 










150 










155 










160 


Arg 


He 


Pro 


Val 


Asp 


Ser 


He 


Tyr 


Ser 


Pro 


Val 


Leu 


Lys 


Val 


Thr 


Tyr 








165 










170 










175 




Lys 


Val 


Asp 


Ala 


Thr 


Arg 


Val 


Glu 


Gin 


Arg 


Thr 


Asp 


Phe 


Asp 


Lys 


Leu 






180 










185 










190 






He 


Leu 


Asp 
195 


Val 


Glu 


Thr 


Lys 


Asn 
200 


Ser 


He 


Ser 


Pro 


Arg 
205 


Asp 


Ala 


Leu 


Ala 


Ser 


Ala 


Gly 


Lys 


Thr 


Leu 


Val 


Glu 


Leu 


Phe 


Gly 


Leu 


Ala 


Arg 


Glu 




210 






215 










220 










Leu 


Asn 


Val 


Glu 


Ala 


Glu 


Gly 


He 


Glu 


lie 


Gly 


Pro 


Ser 


Pro 


Ala 


Glu 


225 










230 










235 










240 


Ala 


Asp 


His 


He 


Ala 


Ser 


Phe 


Ala 


Leu 


Pro 


He 


Asp 


Asp 


Leu 


Asp 


Leu 








245 










250 










255 




Thr Val 


Arg 


Ser 


Tyr 


Asn 


Cys 


Leu 


Lys 


Arg 


Glu 


Gly 


Val 


His 


Thr 


Val 








260 










265 










270 






Gly Glu 


Leu 


Val 


Ala 


Arg 


Thr 


Glu 


Ser 


Asp 


Leu 


Leu 


Asp 


He 


Arg 


Asn 
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275 280 285 

Phe Gly Gin Lys Ser He Asp Glu Val Lys He Lys Leu His Gin Leu 

290 295 300 

Gly Leu Ser Leu Lys Asp Ser Pro Pro Ser Phe Asp Pro Ser Glu Val 
305 310 315 320 

Ala Gly Tyr Asp Val Ala Thr Gly Thr Trp Ser Thr Glu Gly Ala Tyr 

325 330 335 

Asp Glu Gin Asp Tyr Ala Glu Thr Glu Gin Leu 
340 345 

<210> 23 
<211> 1488 
<212> DNA 

<213> M. Tuberculosis 



<220> 
<221> CDS 
<222> (1), 



(1485) 



<400> 23 

atg acc gga aat ttg gtg acc aaa aat teg ctg acc cct gac gtt cgt 

Met Thr Gly Asn Leu Val Thr Lys Asn Ser Leu Thr Pro Asp Val Arg 
15 10 15 



48 



aac ggc ate gac ttt aag ate gec gac ctg tea eta gcg gat ttc ggc 
Asn Gly He Asp Phe Lys He Ala Asp Leu Ser Leu Ala Asp Phe Gly 
20 25 30 



96 



cgc aaa gaa etc egg ate gee gag cac gag atg ccc ggc ctg atg teg 
Arg Lys Glu Leu Arg lie Ala Glu His Glu Met Pro Gly Leu Met Ser 
35 40 45 



144 



ctg egg cgc gag tat gec gag gtg caa ccc ctg aag ggg gec egg ate 
Leu Arg Arg Glu Tyr Ala Glu Val Gin Pro Leu Lys Gly Ala Arg He 
50 55 60 



192 



teg ggt teg ctg cac atg acg gtg cag acc gcg gtg ttg ate gaa acc 
Ser Gly Ser Leu His Met Thr Val Gin Thr Ala Val Leu He Glu Thr 
65 70 75 80 



240 



etc acc gcg ctg ggc gee gaa gtc cgc tgg gee teg tgc aac ate ttc 
Leu Thr Ala Leu Gly Ala Glu Val Arg Trp Ala Ser Cys Asn He Phe 
85 90 " 95 



288 



tec acc cag gat cac gec gee gee gec gtc gtg gtc ggc ccg cac ggc 
Ser Thr Gin Asp His Ala Ala Ala Ala Val Val Val Gly Pro His Gly 
100 105 110 



336 



acc ccc gac gag ccc aag ggt gtc ccg gtg ttc gcg tgg aag ggc gag 
Thr Pro Asp Glu Pro Lys Gly Val Pro Val Phe Ala Trp Lys Gly Glu 
115 120 125 



384 



acg etc gaa gag tac tgg tgg gec gec gag cag atg etc acc tgg ccg 
Thr Leu Glu Glu Tyr Trp Trp Ala Ala Glu Gin Met Leu Thr Trp Pro 
130 135 140 



432 



gac ccc gac aag ccg gee aac atg ate etc gat gac ggc ggt gac gee 
Asp Pro Asp Lys Pro Ala Asn Met He Leu Asp Asp Gly Gly Asp Ala 
145 150 155 ' 160 



480 



acc atg ttg gtg ctg cgc ggc atg cag tat gag aag gec ggc gtg gtg 



528 
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Thr Met Leu Val Leu Arg Gly Met Gin Tyr Glu Lys Ala Gly Val Val 
165 170 175 

ccg ccc gcc gag gag gac gac ccc gcc gag tgg aag gtc ttc ctg aac 576 
Pro Pro Ala Glu Glu Asp Asp Pro Ala Glu Trp Lys Val Phe Leu Asn 
180 * 185 1.90 

ctg eta egg ace cgc ttc gag acc gac aag gac aag tgg acc aag ata 624 
Leu Leu Arg Thr Arg Phe Glu Thr Asp Lys Asp Lys Trp Thr Lys lie 
195 ' 200 205 

gcc gag teg gtc aag ggc gtc acc gag gag acc acc acc ggc gtg ctg 672 
Ala Glu Ser Val Lys Gly Val Thr Glu Glu Thr Thr Thr Gly Val Leu 
210 215 220 

egg etc tac caa ttc gcc gcg gcc ggg gat ctg gcc ttc ccg gcg ate 720 
Arg Leu Tyr Gin Phe Ala Ala Ala Gly Asp Leu Ala Phe Pro Ala lie 
225 230 235 240 

aac gtc aac gac teg gtg acc aag tec aaa ttc gac aac aag tac ggc 768 
Asn Val Asn Asp Ser Val Thr Lys Ser Lys Phe Asp Asn Lys Tyr Gly 
245 250 255 

act egg cac tec ctg ate gac ggc ate aac cgc ggc acc gac gcg ctg 816 
Thr Arg His Ser Leu lie Asp Gly lie Asn Arg Gly Thr Asp Ala Leu 
260 265 270 

ate ggc ggt aag aag gtc etc ate tgc ggc tac ggc gac gtc ggt aag 864 
lie Gly Gly Lys Lys Val Leu lie Cys Gly Tyr Gly Asp Val Gly Lys 
275 280 285 

ggc tgt gcg gag gcg atg aag ggc cag gga gcg egg gtc tec gtc acc . 912 
Gly Cys Ala Glu Ala Met Lys Gly Gin Gly Ala Arg Val Ser Val .Thr 
290 295 300 

gag ate gac ccg ate aac gcg ctg cag gcc atg atg gag ggc ttc gac 960 
Glu He Asp Pro He Asn Ala Leu Gin Ala Met Met Glu Gly Phe Asp 
305 310 315 320 

gtg gtc acc gtc gag gag gcc ate ggg gac gcc gac ate gtc gta acc 1008 
Val Val Thr Val Glu Glu Ala He Gly Asp Ala Asp He Val Val Thr 
325 330 335 

gcg acc ggc aac aaa gac ate ate atg etc gag cac att aag gcg atg 1056 
Ala Thr Gly Asn Lys Asp He He Met Leu Glu His He Lys Ala Met 
340 ^ 345 350 

aag gac cac gcg ate ctg gga aat ate ggc cac ttc gac aac gag ate 1104 
Lys Asp His Ala He Leu Gly Asn He Gly His Phe Asp Asn Glu He 
355 360 365 

gac atg gcc ggg ctg gag cgc tec ggg gcg aca egg gtc aac gtc aag 1152 
Asp Met Ala Gly Leu Glu Arg Ser Gly Ala Thr Arg Val Asn Val Lys 
370 375 380 

cct cag gtc gac ctg tgg acc ttt ggc gac acg ggc cgc teg ate ate 1200 
Pro Gin Val Asp Leu Trp Thr Phe Gly Asp Thr Gly Arg Ser lie He 
385 390 395 400 

gtg ctg tec gag ggg egg ctg ctg aac ctg ggc aat gcc acc ggg cac 1248 
Val Leu Ser Glu Gly Arg Leu Leu Asn Leu Gly Asn Ala Thr Gly His 
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405 410 415 

ccc teg ttc gtg atg age aac age ttc get aac cag acg ate gee cag 1296 
Pro Ser Phe Val Met Ser Asn Ser Phe Ala Asn Gin Thr He Ala Gin 
420 425 430 

ate gag ctg tgg ace aag aac gac gag tac gac aac gag gtg tac egg 1344 
He Glu Leu Trp Thr Lys Asn Asp Glu Tyr Asp Asn Glu Val Tyr Arg 
435 440 445 

ctg ccc aag cac etc gac gag aag gtg get cga ate cat gtc gag gee 1392 
Leu Pro Lys His Leu Asp Glu Lys Val Ala Arg He His Val Glu Ala 
450 455 460 

ctt ggc ggt cac ctg ace aag ctg acc aag gag cag gee gaa tac etc 1440 
Leu Gly Gly His Leu Thr Lys Leu Thr Lys Glu Gin Ala Glu Tyr Leu 
465 470 475 480 

4 

ggc gtc gac gtc gag ggt ccc tac aag ccg gac cac tac cgc tac 1485 

Gly Val Asp Val Glu Gly Pro Tyr Lys Pro Asp His Tyr Arg Tyr 

485 490 495 



tga 

<210> 24 
<211> 495 
<212> PRT 

<213> M. Tuberculosis 
<400> 24 

Met Thr Gly Asn Leu Val Thr Lys Asn Ser Leu Thr Pro Asp Val Arg 

1 5 10 15 

Asn Gly He Asp Phe Lys He Ala Asp Leu Ser Leu Ala Asp Phe Gly 

20 25 30 

Arg Lys Glu Leu Arg He Ala Glu His Glu Met Pro Gly Leu Met Ser 

35 40 45 

Leu Arg Arg Glu Tyr Ala Glu Val Gin Pro Leu Lys Gly Ala Arg lie 

50 55 60 

Ser Gly Ser Leu His Met Thr Val Gin Thr Ala Val Leu He Glu Thr 
65 70 75 80 

Leu Thr Ala Leu Gly Ala Glu Val Arg Trp Ala Ser Cys Asn lie Phe 

85 90 95 

Ser Thr Gin Asp His Ala Ala Ala Ala Val Val Val Gly Pro His Gly 

100 105 no 

Thr Pro Asp Glu Pro Lys Gly Val Pro Val Phe Ala Trp Lys Gly Glu 

115 120 125 

Thr Leu Glu Glu Tyr Trp Trp Ala Ala Glu Gin Met Leu Thr Trp Pro 

130 135 140 

Asp Pro Asp Lys Pro Ala Asn Met lie Leu Asp Asp Gly Gly Asp Ala 
145 150 155 160 

Thr Met Leu Val Leu Arg Gly Met Gin Tyr Glu Lys Ala Gly Val Val. 

165 170 175 

Pro Pro Ala Glu Glu Asp Asp Pro Ala Glu Trp Lys Val Phe Leu Asn 

180 185 190 

Leu Leu Arg Thr Arg Phe Glu Thr Asp Lys Asp Lys Trp Thr Lys lie 

195 200 205 

Ala Glu Ser Val Lys Gly Val Thr Glu Glu Thr Thr Thr Gly Val Leu 

210 215 220 

Arg Leu Tyr Gin Phe Ala Ala Ala Gly Asp Leu Ala Phe Pro Ala He 
225 230 235 240 

Asn Val Asn Asp Ser Val Thr Lys Ser Lys Phe Asp Asn Lys Tyr Gly 



1488 
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245 250 255 



Thr 


Arg 


His 


Ser 


Leu 


He 


Asp Gly 


He 


Asn 


Arg 


Gly 


Thr 


Asp 


Ala 


Leu 






260 










265 










270 






He 


Gly 


Gly 


Lys 


Lys 


Val 


Leu 


He 


Cys 


Gly 


Tyr 


Gly 


Asp 


Val 


Gly 


Lys 




275 










280 










285 








Gly 


Cys 


Ala 


Glu 


Ala 


Met 


Lys 


Gly 


Gin 


Gly 


Ala 


Arg 


Val 


Ser 


Val 


Thr 


290 










295 










300 










Glu 


He 


Asp 


Pro 


He 


Asn 


Ala 


Leu 


Gin 


Ala 


Met 


Met 


Glu 


Gly 


Phe 


Asp 


305 








310 










315 










320 


Val 


Val 


Thr 


Val 


Glu 


Glu 


Ala 


He 


Gly Asp 


Ala 


Asp 


He 


Val 


Val 


Thr 










325 










330 










335 




Ala 


Thr 


Gly 


Asn 


Lys 


Asp 


He 


He 


Met 


Leu 


Glu 


His 


lie 


Lys 


Ala 


Met 






340 










345 










350 






Lys 


Asp 


His 


Ala 


He 


Leu 


Gly Asn 


He 


Gly 


His 


Phe 


Asp 


Asn 


Glu 


He 




355 










360 










365 








Asp 


Met 


Ala 


Gly 


Leu 


Glu 


Arg 


Ser 


Gly Ala 


Thr 


Arg 


Val 


Asn 


Val 


Lys 


370 










375 










380 










Pro 


Gin 


Val 


Asp 


Leu 


Trp 


Thr 


Phe 


Gly Asp 


Thr 


Gly 


Arg 


Ser 


He 


He 


385 










390 










395 










400 


Val 


Leu 


Ser 


Glu 


Gly 


Arg 


Leu 


Leu 


Asn 


Leu 


Gly 


Asn 


Ala 


Thr 


Gly 


His 










405 










410 










415 




Pro 


Ser 


Phe 


Val 


Met 


Ser 


Asn 


Ser 


Phe 


Ala 


Asn 


Gin 


Thr 


He 


Ala 


Gin 








420 










425 










430 






He 


Glu 


Leu 


Trp 


Thr 


Lys 


Asn Asp 


Glu 


Tyr 


Asp 


Asn 


Glu 


Val 


Tyr 


Arg 






435 










440 










445 








Leu 


Pro 


Lys 


His 


Leu 


Asp 


Glu 


Lys 


Val 


Ala 


Arg 


He 


His 


Val 


Glu 


Ala 




450 








455 










460 










Leu 


Gly 


Gly 


His 


Leu 


Thr 


Lys 


Leu 


Thr 


Lys 


Glu 


Gin 


Ala 


Glu 


Tyr 


Leu 


465 










470 










475 










480 


Gly 


Val 


Asp 


Val 


Glu 


Gly 


Pro 


Tyr 


Lys 


Pro 


Asp 


His 


Tyr 


Arg 


Tyr 










485 










490 










495 





<210> 25 
<211> 1803 
<212> DNA 

<213> M. Tuberculosis 



<220> 

<221> CDS 

<222> (1) . . . (1800) 



<400> 25 

gtg get agt cac gec ggc teg agg ate get egg ate tct aag gtt etc 48 

Val Ala Ser His Ala Gly Ser Arg He Ala Arg He Ser Lys Val Leu 
15 10 15 



gtc gee aat cgc ggc gag ate gca gtg egg gtg ate egg gcg gee cgc 96 

Val Ala Asn Arg Gly Glu He Ala Val Arg Val He Arg Ala Ala Arg 
20 ' 25 30 

gac gee ggc ctg ccc age gtg gcg gtg tac gee gaa ccc gac gee gag 144 

Asp Ala Gly Leu Pro Ser Val Ala Val Tyr Ala Glu Pro Asp Ala Glu 
35 40 45 



tec ccg cat gtt egg ctg gec gac gag gcg ttc gcg ctg ggc ggc cag 192 

Ser Pro His Val Arg Leu Ala Asp Glu Ala Phe Ala Leu Gly Gly Gin 

50 , 55 60 

ace teg gcg gag tec tat ctg gac' ttc gee aag ate etc gac gcg gca 240 

Thr Ser Ala Glu Ser Tyr Leu Asp Phe Ala Lys He Leu Asp Ala Ala 

65 70 75 80 
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gcc aag tec ggg gec aac gec ate cac ccc ggc tac ggc ttc eta gcg 288 
Ala Lys Ser Gly Ala Asn Ala lie His Pro Gly Tyr Gly Phe Leu Ala 
85 90 * " 95 

gaa aat gcc gac ttc gcc cag gcg gtg ate gac gcc ggc ctg ate tgg 336 
Glu Asn Ala Asp Phe Ala Gin Ala Val lie Asp Ala Gly Leu lie Trp 
100 105 " 110 

ate ggc ccc age ccg cag teg ate cgc gac ctg ggc gac aag gtc acg 384 
lie Gly Pro Ser Pro Gin Ser lie Arg Asp Leu Gly Asp Lys Val Thr 
115 120 ' 125 

gcc cgt cac ate gcg gcc cgc get cag gcg ccc ctg gtg ccg ggt ace 4 32 

Ala Arg His He Ala Ala Arg Ala Gin Ala Pro Leu Val Pro Gly Thr 
130 135 140 

ccc gat ccg gtc aaa ggc gcc gac gag gtg gtg gca ttc gcc gag gag 480 
Pro Asp Pro Val Lys Gly Ala Asp Glu Val Val Ala Phe Ala Glu Glu 
145 150 155 160 

tac ggc ctg ccg ate gcg ate aag gcc gcc cac ggc ggc ggc ggc aag 528 
Tyr Gly Leu Pro He Ala He Lys Ala Ala His Gly Gly Gly Gly Lys 
165 170 " 175 

ggc atg aag gtg gcc cgc ace ate gac gag att ccg gag ctg tac gag 576 
Gly Met Lys Val Ala Arg Thr He Asp Glu He Pro Glu Leu Tyr Glu 
180 185 190 

teg gcg gtg cgc gag gcc acg gcc gcg ttc ggc cgc ggt gag tgc tac 624 
Ser Ala Val Arg Glu Ala Thr Ala Ala Phe Gly Arg Gly Glu Cys Tyr 
195 200 205 

gtg gag cgc tat etc gac aag ccg cgc cac gtc gaa gca cag gtg ate 672 
Val Glu Arg Tyr Leu Asp Lys Pro Arg His Val Glu Ala Gin Val He 
210 215 220 

gcc gac cag cac ggc aac gtc gtc gtc gcc ggc ace egg gac tgc teg 720 
Ala Asp Gin His Gly Asn Val Val Val Ala Gly Thr Arg Asp Cys Ser 
225 230 235 ' 240 

ctg cag cgc cgc tac cag aag ctg gtc gag gag gcg ccc gca ccg ttc 768 
Leu Gin Arg Arg Tyr Gin Lys Leu Val Glu Glu Ala Pro Ala Pro Phe 
245 250 255 

ctg ace gac ttt caa cgc aaa gag ate cac gac teg gcc aaa egg att 816 
Leu Thr Asp Phe Gin Arg Lys Glu He His Asp Ser Ala Lys Arg He 
260 265 270 

tgc aaa gag gcc cat tac cac ggc gcc ggc ace gtc gaa tac ctg gtc 864 
Cys Lys Glu Ala His Tyr His Gly Ala Gly Thr Val Glu Tyr Leu Val 
275 280 285 

ggt cag gac ggc ttg ate teg ttc ttg gag gtc aac acg cgc ctt cag 912 
Gly Gin Asp Gly Leu He Ser Phe Leu Glu Val Asn Thr Arg Leu Gin 
290 295 300 

gta gaa cac ccg gtc ace gag gaa acc gcg ggc ate gac ttg gtg ctg 960 
Val Glu His Pro Val Thr Glu Glu Thr Ala Gly He Asp Leu Val Leu 
305 310 315 320 
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cag 
Gin 


caa 
Gin 


ttc 
Phe 


egg 
Arg 


ate 
He 
325 


gec 
Ala 


aac 
Asn 


ggc 
Gly 


gaa 
Glu 


aag 
Lys 
330 


ctg 
Leu 


gac 
Asp 


ate 
He 


acc 
Thr 


gag 
Glu 
335 


gat 
Asp 


1008 


ccc 
Pro 


acc 
Thr 


ccg 
Pro 


cgc 
Arg 
340 


ggg 

Gly 


cac 
His 


gee 
Ala 


ate 
He 


gaa 
Glu 
345 


ttc 
Phe 


egg 
Arg 


ate 
He 


aac 
Asn 


ggc 
Gly 
350 


gag 
Glu 


gac 
Asp 


1056 


gcg 
Ala 


ggg 

Gly 


cgt 
Arg 
355 


aac 
Asn 


ttc 
Phe 


eta 
Leu 


ccg 
Pro 


gcg 
Ala 
360 


ccc 
Pro 


ggg 

Gly 


ccg 
Pro 


gtg 
Val 


aca 
Thr 
365 


aag 
Lys 


ttc 
Phe 


cac 
His 


1104 


ccg 
Pro 


ccg 
Pro 
370 


tec 
Ser 


ggc 
Gly 


ccc 
Pro 


ggt 
Gly 


gtg 

Val 
375 


egg 
Arg 


gtg 
Val 


gac 
Asp 


tec 
Ser 


ggt 
Gly 
380 


gtc 
Val 


gag 
Glu 


acc 
Thr 


ggc 
Gly 


1152 


teg 
Ser 
385 


gtg 
Val 


ate 
He 


ggc 
Gly 


ggc 
Gly 


cag 
Gin 
390 


ttc 
Phe 


gac 
Asp 


teg 
Ser 


atg 
Met 


ctg 
Leu 
395 


gec 
Ala 


aag 
Lys 


ctg 
Leu 


ate 
lie 


gtg 
Val 
400 


1200 


cac 
His 


ggt 
Gly 


gec 
Ala 


gac 
Asp 


cgc 
Arg 
405 


gec 
Ala 


gag 
Glu 


gcg. 
Ala 


ctg 
Leu 


gcg 
Ala 
410 


egg 
Arg 


gec 
Ala 


egg 
Arg 


cgc 
Arg 


gcg 
Ala 
415 


ctg 
Leu 


1248 


aac 
Asn 


gag 
Glu 


ttc 
Phe 


ggt 
Gly 
420 


gtc 
Val 


gaa 
Glu 


ggc 
Gly 


ctg 
Leu 


gcg 
Ala 
425 


acg 
Thr 


gtc 
Val 


ate 
He 

\ 


ccg 
Pro 


ttt 
Phe 
430 


cac 
His 


cgc 
Arg 


1296 


gec 
Ala 


gtg 
Val 


gtg 
Val 
435 


tec 
Ser 


gac 
Asp 


ccg 
Pro 


gca 
Ala 


ttc 
Phe 
440 


ate 
He 


ggc 
Gly 


gac 
Asp 


gcg 
Ala 


aac 
Asn 
445 


ggc 
Gly 


ttt 
Phe 


teg 
Ser 


1344 


gta 
Val 


cat 
His 
450 


acc 
Thr 


cgc 
Arg 


tgg 
Trp 


ate 
He 


gag 
Glu 
455 


acc 
Thr 


gag 
Glu 


tgg 
Trp 


aat 
Asn 


aac 
Asn 
460 


acc 
Thr 


ate 
He 


gag 
Glu 


ccc 
Pro 


1392 


ttt 
Phe 
465 


acc 
Thr 


gac 
Asp 


ggc 
Gly 


gaa 
Glu 


cct 
Pro 
470 


etc 
Leu 


gac 
Asp 


gag 
Glu 


gac 
Asp 


gec 
Ala 
475 


egg 
Arg 


ccg 
Pro 


cgt 
Arg 


cag 
Gin 


aag 
Lys 
480 


1440 


gtg 
Val 


gtc 
Val 


gtc 
Val 


gaa 
Glu 


ate 
He 
485 


gac 
Asp 


ggt 
Gly 


cgc 
Arg 


cgc 
Arg 


gtc 
Val 
490 


gaa 
Glu 


gtc 
Val 


teg 
Ser 


ctg 
Leu 


ccg 
Pro 
495 


get 
Ala 


1488 


gat 
Asp 


etc 
Leu 


gcg 
Ala 


ctg 
Leu 
500 


tec 
Ser 


aat 
Asn 


ggc 
Gly 


ggc 
Gly 


ggt 
Gly 
505 


tgc 
Cys 


gac 
Asp 


ccg 
Pro 


gtc 
Val 


ggt 
Gly 
510 


gtc 
Val 


ate 
He 


1536 


egg cgc 
Arg Arg 


aag 
Lys 
515 


ccc 
Pro 


aag 
Lys 


ccg 
Pro 


cgc 
Arg 


aag 
Lys 
520 


egg 
Arg 


ggt 
Gly 


gcg 
Ala 


cac 
His 


acc 
Thr 
525 


ggc 
Gly 


gcg 
Ala 


gcg 
Ala 


1584 


gee 
Ala 


tec 
Ser 
530 


ggt 
Gly 


gac 
Asp 


gcg 
Ala 


gtg 
Val 


acc 
Thr 
535 


gcg 
Ala 


cct 
Pro 


atg 
Met 


cag 
Gin 


ggc 
Gly 
540 


acc 
Thr 


gta 
Val 


gtt 
Val 


aag 
Lys 


1632 


ttc 
Phe 
545 


gcg 
Ala 


gtc 
Val 


gaa 
Glu 


gaa 
Glu 


ggg 

Gly 
550 


caa 
Gin 


gag 

GlU 


gtc 
Val 


gtg 
vai 


gee 
Ala 
555 


ggc 
ijiy 


gac 
Asp 


eta 
Leu 


gtg 
v ai 


gtg 
vai 
560 


1680 


gtc 


etc 


gag 


gcg 


atg 


aag 


atg 


gaa 


aac 


ccg 


gtc 


acc 


gcg 


cat 


aag 


gat 


1728 
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V 



Val Leu Glu Ala Met Lys Met Glu Asn Pro Val Thr Ala His Lys Asp 
565 570 575 

ggc acc ate acc ggg ctg gcg gtc gag gcg ggc gcg gec ate ace cag 1776 
Gly Thr He Thr Gly Leu Ala Val Glu Ala Gly Ala Ala He Thr Gin 
580 585 590 

ggc acg gtg etc gee gag ate aag taa 1803 
Gly Thr Val Leu Ala Glu He Lys 
595 600 



<210> 26 
<211> 600 
<212> PRT 

<213> M. Tuberculosis 
<400> 26 

Val Ala Ser His Ala Gly Ser Arg He Ala Arg He Ser Lys Val Leu 

1 5 10 15 

Val Ala Asn Arg Gly Glu He Ala Val Arg Val He Arg Ala Ala Arg 

20 25 30 

Asp Ala Gly Leu Pro Ser Val Ala Val Tyr Ala Glu Pro Asp Ala Glu 

35 40 45 

Ser Pro His Val Arg Leu Ala Asp Glu Ala Phe Ala Leu Gly Gly Gin 

50 55 60 

Thr Ser Ala Glu Ser Tyr Leu Asp Phe Ala Lys He Leu Asp Ala Ala 
65 70 75 80 

Ala Lys Ser Gly Ala Asn Ala He His Pro Gly Tyr Gly Phe Leu Ala 

85 90 " 95 

Glu Asn Ala Asp Phe Ala Gin Ala Val He Asp Ala Gly Leu He Trp 

100 105 no 

He Gly Pro Ser Pro Gin Ser lie Arg Asp Leu Gly Asp Lys Val Thr 

115 120 125 

Ala Arg His He Ala Ala Arg Ala Gin Ala Pro Leu Val Pro Gly Thr 

130 135 140 

Pro Asp Pro Val Lys Gly Ala Asp Glu Val Val Ala Phe Ala Glu Glu 
145 150 155 160 

Tyr Gly Leu Pro lie Ala He Lys Ala Ala His Gly Gly Gly Gly Lys 

165 170 ' 175 

Gly Met Lys Val Ala Arg Thr lie Asp Glu lie Pro Glu Leu Tyr Glu 

180 185 190 

Ser Ala Val Arg Glu Ala Thr Ala Ala Phe Gly Arg Gly Glu Cys Tyr 

195 200 * 205 

Val Glu Arg Tyr Leu Asp Lys Pro Arg His Val Glu Ala Gin Val lie 

210 215 220 

Ala Asp Gin His Gly Asn Val Val Val Ala Gly Thr Arg Asp Cys Ser 
225 230 235 240 

Leu Gin Arg Arg Tyr Gin Lys Leu Val Glu Glu Ala Pro Ala Pro Phe 

245 250 255 

Leu Thr Asp Phe Gin Arg Lys Glu lie His Asp Ser Ala Lys Arg He 

260 265 * 270 

Cys Lys Glu Ala His Tyr His Gly Ala Gly Thr Val Glu Tyr Leu Val 

275 280 285 

Gly Gin Asp Gly Leu He Ser Phe Leu Glu Val Asn Thr Arg Leu Gin 

290 295 300 

Val Glu His Pro Val Thr Glu Glu Thr Ala Gly lie Asp Leu Val Leu 
305 310 315 * 320 

Gin Gin Phe Arg lie Ala Asn Gly Glu Lys Leu Asp He Thr Glu Asp 

325 330 " 335 

Pro Thr Pro Arg Gly His Ala He Glu Phe Arg He Asn Gly Glu Asp 
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340 345 350 



Ala 


Gly 


Arg 


Asn 


Phe 


Leu 


Pro 


Ala 


Pro 


Gly 


Pro 


Val 


Thr 


Lys 


Phe 


His 




355 










360 










365 








Pro 


Pro 


Ser 


Gly 


Pro 


Gly 


Val 


Arg 


Val 


Asp 


Ser 


Gly 


Val 


Glu 


Thr 


Gly 




370 










375 










. 380 










Ser 


Val 


He 


Gly 


Gly 


Gin 


Phe 


Asp 


Ser 


Met 


Leu 


Ala 


Lys 


Leu 


He 


Val 


385 










390 










395 










400 


His 


Gly 


Ala 


Asp 


Arg 


Ala 


Glu 


Ala 


Leu 


Ala 


Arg 


Ala 


Arg 


Arg 


Ala 


Leu 










405 










410 










415 




Asn 


Glu 


Phe 


Gly 


Val 


Glu 


Gly 


Leu 


Ala 


Thr 


Val 


He 


Pro 


Phe 


His 


Arg 








420 










425 










430 






Ala 


Val 


Val 


Ser 


Asp 


Pro 


Ala 


Phe 


He 


Gly 


Asp 


Ala 


Asn 


Gly 


Phe 


Ser 






435 










440 










445 








Val 


His 


Thr 


Arg 


Trp 


He 


Glu 


Thr 


Glu 


Trp 


Asn 


Asn 


Thr 


He 


Glu 


Pro 




450 










455 










4 60 










Phe 


Thr 


Asp 


Gly 


Glu 


Pro 


Leu 


Asp 


Glu 


Asp 


Ala 


Arg 


Pro 


Arg 


Gin 


Lys 


465 










470 










475 










480 


Val 


Val 


Val 


Glu 


He 


Asp 


Gly 


Arg 


Arg 


Val 


Glu 


Val 


Ser 


Leu 


Pro 


Ala 










485 










490 










495 




Asp 


Leu 


Ala 


Leu 


Ser 


Asn 


Gly 


Gly 


Gly 


Cys 


Asp 


Pro 


Val 


Gly 


Val 


He 






500 










505 










510 






Arg 


Arg 


Lys 


Pro 


Lys 


Pro 


Arg 


Lys 


Arg 


Gly 


Ala 


His 


Thr 


Gly 


Ala 


Ala 






515 










520 










525 








Ala 


Ser 


Gly 


Asp 


Ala 


Val 


Thr 


Ala 


Pro 


Met 


Gin 


Gly 


Thr 


Val 


Val 


Lys 




530 










535 










540 










Phe 


Ala 


Val 


Glu 


Glu 


Gly 


Gin 


Glu 


Val 


Val 


Ala 


Gly 


Asp 


Leu 


Val 


Val 


545 










550 










555 










560 


Val 


Leu 


Glu 


Ala 


Met 


Lys 


Met 


Glu 


Asn 


Pro 


Val 


Thr 


Ala 


His 


Lys 


Asp 










565 










570 










575 




Gly 


Thr 


He 


Thr 


Gly 


Leu 


Ala 


Val 


Glu 


Ala 


Gly 


Ala 


Ala 


He 


Thr 


Gin 






580 










585 










590 






Gly 


Thr 


Val 


Leu 


Ala 


Glu 


He 


Lys 




















595 










600 



















<210> 27 
<211> 318 
<212> DNA 

<213> M. Tuberculosis 



<220> 

<221> CDS 

<222> (1) . . . (315) 

<400> 27 

atg cca gtg gtg aag ate aac gca ate gag gtg ccc gec ggc get ggc 48 

Met Pro Val Val Lys He Asn Ala He Glu Val Pro Ala Gly Ala Gly 
1 5 10 15 

ccc gag ctg gag aag egg. ttc get cac cgc gcg cac gcg gtc gag aac 96 
Pro Glu Leu Glu Lys Arg Phe Ala His Arg Ala His Ala Val Glu Asn 
20 25 30 

tec ccg ggt ttc etc ggc ttt cag ctg tta cgt ccg gtc aag ggt gaa 14 4 

Ser Pro Gly Phe Leu Gly Phe Gin Leu Leu Arg Pro Val Lys Gly Glu 
35 40 45 

gaa cgc tac ttc gtg gtg aca cac tgg gag tec gat gaa gca ttc cag 192 
• Glu Arg Tyr Phe Val Val Thr His Trp Glu Ser Asp Glu Ala Phe Gin 
50 55 60 



gcg tgg gca aac ggg ccc gec ate gca gec cat gec gga cac egg gee 



240 
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26 

Ala Trp Ala Asn Gly Pro Ala He Ala Ala His Ala Gly His Arg Ala 
65 70 75 80 

aac ccc gtg gcg acc ggt get teg ctg ctg gaa ttc gag gtc gtg ctt 288 
Asn Pro Val Ala Thr Gly Ala Ser Leu Leu Glu Phe Glu Val Val Leu 
85 90 95 

gac gtc ggt ggg acc ggc aag act gca taa 318 
Asp Val Gly Gly Thr Gly Lys Thr Ala 
100 105 



<210> 28 
<211> 105 
<212> PRT 

<213> M. Tuberculosis 
<400> 28 



Met 


Pro 


Val 


Val 


Lys 


He Asn 


Ala 


He 


Glu 


Val 


Pro 


Ala 


Gly Ala Gly 


1 








5 








10 










15 


Pro 


Glu 


Leu 


Glu 


Lys 


Arg Phe 


Ala 


His Arg Ala 


His 


Ala 


Val 


Glu Asn 








20 








25 










30 




Ser 


Pro 


Gly 


Phe 


Leu 


Gly Phe 


Gin 


Leu 


Leu 


Arg 


Pro 


Val 


Lys 


Gly Glu 






35 








40 










45 


Glu Arg 


Tyr 


Phe 


Val 


Val Thr 


His 


Trp 


Glu 


Ser 


Asp 


Glu 


Ala 


Phe Gin 




50 








55 










60 








Ala 


Trp 


Ala 


Asn 


Gly 


Pro Ala 


He 


Ala 


Ala 


His 


Ala 


Gly 


His 


Arg Ala 


65 










70 








75 








80 


Asn 


Pro 


Val 


Ala 


Thr 
85 


Gly Ala 


Ser 


Leu 


Leu 
90 


Glu 


Phe 


Glu 


Val 


Val Leu 
95 


Asp 


Val 


Gly Gly 


Thr 


Gly Lys 


Thr 


Ala 















100 105 



<210> 29 
<211> 435 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . . (435) 

<400> 29 

gtg gcg gac aag acg aca cag acg att tac ate gac gcg gat cca ggc 48 
Val Ala Asp Lys Thr Thr Gin Thr He Tyr He Asp Ala Asp Pro Gly 
1 5 10 15 

gag gtg atg aag gcg ate gec gac ate gaa gee tac ccg caa tgg att 96 
Glu Val Met Lys Ala He Ala Asp He Glu Ala Tyr Pro Gin Trp He 
20 25 30 

teg gag tat aag gaa gtc gag ate eta gag gec gac gac gag ggc tac 144 
Ser Glu Tyr Lys Glu Val Glu He Leu Glu Ala Asp Asp Glu Gly Tyr 
35 40 45 

ccg aaa cga gcg cga atg ttg atg gac gca gee ate ttc aaa gac acc 192 
Pro Lys Arg Ala Arg Met Leu Met Asp Ala Ala He Phe Lys Asp Thr 
50 55 60 

ttg ate atg tec tac gag tgg ccg gaa gac cgc caa teg ctt age tgg 240 
Leu He Met Ser Tyr Glu Trp Pro Glu Asp Arg Gin Ser Leu Ser Trp 
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65 70 75 80 

act etc gaa tec age teg ctg eta aag tec etc gaa ggc acg tat cgc 288 
Thr Leu Glu Ser Ser Ser Leu Leu Lys Ser Leu Glu Gly Thr Tyr Arg 
85 90 95 

ttg gcg ccc aag ggt tct ggc act gag gtc acc tac gag ctt gee gtc 336 
Leu Ala Pro Lys Gly Ser Gly Thr Glu Val Thr Tyr Glu Leu Ala Val 
100 105 110 

gac ctt get gtc ccc atg ate ggg atg etc aag cgt aag gcg gaa cgc 384 
Asp Leu Ala Val Pro Met lie Gly Met Leu Lys Arg Lys Ala Glu Arg 
115 120 125 

agg ttg ata gac ggc gcg ttg aag gat ctg aag aaa cga gtc gag ggc 432 
Arg Leu lie Asp Gly Ala Leu Lys Asp Leu Lys Lys Arg Val Glu Gly 
130 135 140 

tga 435 



<210> 30 
<211> 144 
<212> PRT 

<213> M. Tuberculosis 



<400> 30 



Met 


Ala 


Asp 


Lys 


Thr 


Thr 


Gin 


Thr 


He 


Tyr 


He 


Asp 


Ala 


Asp 


Pro Gly 


1 








5 










10 










15 


Glu 


Val 


Met 


Lys 


Ala 


He 


Ala 


Asp 


lie 


Glu 


Ala 


Tyr 


Pro 


Gin 


Trp lie 








20 










25 










30 




Ser 


Glu 


Tyr 


Lys 


Glu 


Val 


Glu 


He 


Leu 


Glu 


Ala 


Asp Asp Glu 


Gly Tyr 






35 










40 










45 






Pro 


Lys 


Arg 


Ala 


Arg 


Met 


Leu 


Met 


Asp 


Ala 


Ala 


He 


Phe 


Lys 


Asp Thr 




50 










55 










60 








Leu 


He 


Met 


Ser 


Tyr 


Glu 


Trp 


Pro 


Glu 


Asp 


Arg 


Gin 


Ser 


Leu 


Ser Trp 


65 










70 










75 








80 


Thr 


Leu 


Glu 


Ser 


Ser 


Ser 


Leu 


Leu 


Lys 


Ser 


Leu 


Glu 


Gly 


Thr 


Tyr Arg 










85 










90 










95 


Leu 


Ala 


Pro 


Lys 


Gly 


Ser 


Gly Thr 


Glu 


Val 


Thr 


Tyr 


Glu 


Leu 


Ala Val 








100 










105 










110 




Asp 


Leu 


Ala 


Val 


Pro 


Met 


He 


Gly Met 


Leu 


Lys 


Arg 


Lys 


Ala 


Glu Arg 






115 










120 










125 






Arg 


Leu 


He 


Asp 


Gly 


Ala 


Leu 


Lys 


Asp 


Leu 


Lys 


Lys 


Arg 


Val 


Glu Gly 



130 135 140 



<210> 31 

<211> 441 

<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . . (438) 

<400> 31 

atg cca gtt ttg age aag acc gtc gag gtc acc gee gac gee gca teg 48 

Met Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser 
15 10 15 
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ate atg gec ate gtt gee gat ate gag cgc tac cca gag tgg aat gaa 96 
He Met Ala He Val Ala Asp He Glu Arg Tyr Pro Glu Trp Asn Glu 
20 25 30 

ggg gtc aag ggc gca tgg gtg etc get cgc tac gat gac ggg cgt ccc 14 4 

Gly Val Lys Gly Ala Trp Val Leu Ala Arg Tyr Asp Asp Gly Arg Pro 
35 40 45 

age cag gtg egg etc gac acc get gtt caa ggc ate gag ggc ace tat 192 
Ser Gin Val Arg Leu Asp Thr Ala Val Gin Gly He Glu Gly Thr Tyr 
50 55 60 

ate cac gee gtg tac tac cca ggc gaa aac cag att caa acc gtc atg 240 
He His Ala Val Tyr Tyr Pro Gly Glu Asn Gin He Gin Thr Val Met 
65 70 75 80 

cag cag ggt gaa ctg ttt gee aag cag gag cag ctg ttc agt gtg gtg 288 
Gin Gin Gly Glu Leu Phe Ala Lys Gin Glu Gin Leu Phe Ser Val Val 
85 90 95 

gca acc ggc gee gcg age ttg etc acg gtg gac atg gac gtc cag gtc 336 
Ala Thr Gly Ala Ala Ser Leu Leu Thr Val Asp Met Asp Val Gin Val 
100 105 no 

acc atg ccg gtg ccc gag ccg atg gtg aag atg ctg etc aac aac gtc 384 
Thr Met Pro Val Pro Glu Pro Met Val Lys Met Leu Leu Asn Asn Val 
115 120 125 

ctg gag cat etc gee gaa aat etc aag cag cgc gee gag cag ctg gcg 432 
Leu Glu His Leu Ala Glu Asn Leu Lys Gin Arg Ala Glu Gin Leu Ala 
130 135 140 

gee age taa 441 

Ala Ser 

145 



<210> 32 
<211> 146 
<212> PRT 

<213> M. Tuberculosis 



<400> 32 



Met 


Pro 


Val 


Leu 


Ser 


Lys 


Thr 


Val 


Glu 


Val Thr Ala 


Asp Ala Ala 


Ser 


1 








5 










10 


15 




He 


Met 


Ala 


He 


Val 


Ala 


Asp He Glu Arg Tyr Pro Glu Trp Asn Glu 








20 










25 




30 




Gly Val 


Lys 


Gly 


Ala 


Trp 


Val 


Leu 


Ala 


Arg Tyr Asp Asp Gly Arg 


Pro 






35 










40 






45 




Ser 


Gin 


Val 


Arg 


Leu 


Asp 


Thr 


Ala 


Val 


Gin Gly He 


Glu Gly Thr 


Tyr 




50 










55 






60 


He 


His 


Ala 


Val 


Tyr 


Tyr 


Pro Gly Glu Asn Gin He 


Gin Thr Val 


Met 


65 










70 








75 




80 


Gin 


Gin 


Gly Glu 


Leu 


Phe 


Ala 


Lys 


Gin 


Glu Gin Leu 


Phe Ser Val 


Val 










85 










90 


95 




Ala 


Thr 


Gly Ala 


Ala 


Ser 


Leu 


Leu 


Thr 


Val Asp Met 


Asp Val Gin 


Val 








100 










105 




110 




Thr 


Met 


Pro 


Val 


Pro 


Glu 


Pro 


Met 


Val 


Lys Met Leu 


Leu Asn Asn 


Val 






115 










120 






125 




Leu 


Glu 


His 


Leu 


Ala 


Glu 


Asn 


Leu 


Lys 


Gin Arg Ala 


Glu Gin Leu 


Ala 
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130 135 140 

Ala Ser 
145 

<210> 33 
<211> 894 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . . (891) 

<400> 33 

atg tea teg ggc aat tea tct ctg gga att ate gte ggg ate gac gat 

Met Ser Ser Gly Asn Ser Ser Leu Gly lie lie Val Gly lie Asp Asp 
1 5 10 15 



48 



tea ccg gcc gca cag gtt gcg gtg egg tgg gca get egg gat gcg gag 96 
Ser Pro Ala Ala Gin Val Ala Val Arg Trp Ala Ala Arg Asp Ala Glu 
20 25 30 

ttg cga aaa ate cct ctg acg etc gtg cac gcg gtg teg ccg gaa gta 144 
Leu Arg Lys lie Pro Leu Thr Leu Val His Ala Val Ser Pro Glu Val 
35 40 45 

gcc ace tgg ctg gag gtg cca ctg ccg ccg ggc gtg ctg cga tgg cag 192 
Ala Thr Trp Leu Glu Val Pro Leu Pro Pro Gly Val Leu Arg Trp Gin 
50 55 60 

cag gat cac ggg cgc cac ctg ate gac gac gca etc aag gtg gtt gaa 240 
Gin Asp His Gly Arg His Leu lie Asp Asp Ala Leu Lys Val Val Glu 
65 70 75 80 

cag get teg ctg cgc get ggt ccc ccc acg gte cac agt gaa ate gtt 288 
Gin Ala Ser Leu Arg Ala Gly Pro Pro Thr Val His Ser Glu lie Val 
85 90 95 

ccg gcg gca gcc gtt ccc aca ttg gte gac atg tec aaa gac gca gtg 336 
Pro Ala Ala Ala Val Pro Thr Leu Val Asp Met Ser Lys Asp Ala Val 
100 105 110 

ctg atg gte gtg ggt tgt etc gga agt ggg egg tgg ccg ggc egg ctg 384 
Leu Met Val Val Gly Cys Leu Gly Ser Gly Arg Trp Pro Gly Arg Leu 
115 " " 120 125 

etc ggt teg gte agt tec ggc ctg etc cgc cac gcg cac tgt ccg gte 432 
Leu Gly Ser Val Ser Ser Gly Leu Leu Arg His Ala His Cys Pro Val 
130 135 140 

gtg ate ate cac gac gaa gat teg gtg atg ccg cat ccc cag eaa gcg 480 
Val lie lie His Asp Glu Asp Ser Val Met Pro His Pro Gin Gin Ala 
145 150 155 160 

ccg gtg eta gtt ggc gtt gac ggc teg teg gcc tec gag ctg gcg ace 528 
Pro Val Leu Val Gly Val Asp Gly Ser Ser Ala Ser Glu Leu Ala Thr 
165 170 175 

gca ate gca ttc gac gaa gcg teg egg cga aac gtg gac ctg gtg gcg 57 6 

Ala He Ala Phe Asp Glu Ala Ser Arg Arg Asn Val Asp Leu Val Ala 
180 185 190 



WO 00/21983 



30 



PCI7DK99/00538 



ctg cac gca tgg age gac gtc gat gtg teg gag tgg ccc gga ate gat 624 
Leu His Ala Trp Ser Asp Val Asp Val Ser Glu Trp Pro Gly lie Asp 
195 200 205 

tgg ccg gca act cag teg atg gee gag cag gtg ctg gee gag egg ttg 672 
Trp Pro Ala Thr Gin Ser Met Ala Glu Gin Val Leu Ala Glu Arg Leu 
210 215 220 

gcg ggt tgg cag gag egg tat ccc aac gta gee ata acc cgc gtg gtg 720 
Ala Gly Trp Gin Glu Arg Tyr Pro Asn Val Ala He Thr Arg Val Val 
225 230 235 240 

gtg cgc gat cag ccg gec cgc cag etc gtc caa cgc tec gag gaa gee 768 
Val Arg Asp Gin Pro Ala Arg Gin Leu Val Gin Arg Ser Glu Glu Ala 
245 250 ' 255 

cag ctg gtc gtg gtc ggc age egg ggc cgc ggc ggc tac gee gga atg 816 
Gin Leu Val Val Val Gly Ser Arg Gly Arg Gly Gly Tyr Ala Gly Met 
260 265 ' 270 

ctg gtg ggg teg gta ggc gaa acc gtt get cag ctg gcg egg acg ccg 8 64 

Leu Val Gly Ser Val Gly Glu Thr Val Ala Gin Leu Ala Arg Thr Pro 
275 280 285 

gtc ate gtg gca cgc gag teg ctg act tag 894 
Val He Val Ala Arg Glu Ser Leu Thr 
290 295 



<210> 34 
<211> 297 
<212> PRT 

<213> M. Tuberculosis 



<400> 34 



Met Ser 


Ser Gly 


Asn 


Ser 


Ser 


Leu 


Gly He He 


Val 


Gly 


lie Asp Asp 


1 






5 








10 






15 


Ser Pro 


Ala 


Ala 


Gin 


Val 


Ala 


Val 


Arg Trp Ala Ala Arg Asp Ala Glu 






20 










25 






30 


Leu Arg 


Lys 


He 


Pro 


Leu 


Thr 


Leu 


Val His Ala 


Val 


Ser 


Pro Glu Val 




35 










40 






45 




Ala Thr 


Trp 


Leu 


Glu 


Val 


Pro 


Leu 


Pro Pro Gly Val 


Leu Arg Trp Gin 


50 










55 






60 






Gin Asp 


His 


Gly 


Arg 


His 


Leu 


He Asp Asp Ala Leu Lys Val Val Glu 


65 








70 






75 






80 


Gin Ala 


Ser 


Leu 


Arg 


Ala 


Gly 


Pro 


Pro Thr Val 


His 


Ser 


Glu lie Val 








85 








90 






95 


Pro Ala 


Ala 


Ala 


Val 


Pro 


Thr 


Leu 


Val Asp Met 


Ser 


Lys 


Asp Ala Val 






100 










105 






110 


Leu Met 


Val 


Val 


Gly 


Cys 


Leu 


Gly 


Ser Gly Arg 


Trp 


Pro 


Gly Arg Leu 




115 










120 






125 




Leu Gly 


Ser 


Val 


Ser 


Ser 


Gly Leu 


Leu Arg His 


Ala 


His 


Cys Pro Val 


130 










135 






140 




Val He 


He 


His 


Asp 


Glu 


Asp 


Ser 


Val Met Pro 


His 


Pro 


Gin Gin Ala 


145 








150 






155 






160 


Pro Val 


Leu 


Val 


Gly 


Val 


Asp Gly Ser Ser Ala 


Ser 


Glu 


Leu Ala Thr 








165 








170 






175 


Ala He 


Ala 


Phe 


Asp 


Glu 


Ala 


Ser 


Arg Arg Asn 


Val 


Asp 


Leu Val Ala 






180 










185 






190 


Leu His 


Ala 


Trp 


Ser 


Asp 


Val 


Asp 


Val Ser Glu 


Trp 


Pro Gly lie Asp 
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195 



Trp 


Pro 


Ala 


Thr 


Gin 


Ser 




210 










Ala 


Gly 


Trp 


Gin 


Glu 


Arg 


225 










230 


Val 


Arg 


Asp 


Gin 


Pro 


Ala 










245 




Gin 


Leu 


Val 


Val 


Val 


Gly 








260 






Leu 


Val 


Gly 


Ser 


Val 


Gly 






275 








Val 


He 


Val 


Ala 


Arg 


Glu 



290 



200 



Met 


Ala 


Glu 


Gin 


Val 


Leu 


215 










220 


Tyr 


Pro 


Asn 


Val 


Ala 
235 


He 


Arg 


Gin 


Leu 


Val 
250 


Gin 


Arg 


Ser 


Arg 


Gly 
265 


Arg 


Gly 


Gly 


Glu 


Thr 
280 


Val 


Ala 


Gin 


Leu 


Ser 


Leu 


Thr 









295 
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205 



Ala 


Glu 


Arg 


Leu 


Thr 


Arg 


Val 


Val 








240 


Ser 


Glu 


Glu 


Ala 






255 




Tyr 


Ala 


Gly 


Met 




270 






Ala 


Arg 


Thr 


Pro 


285 









<210> 35 
<211> 957 
<212> DNA 

<213> M. Tuberculosis 



<220> 

<221> CDS 

<222> (1) . . . (954) 



<400> 35 

atg get gaa gta ctg gtg etc gtt gag cac get gaa ggc gcg tta aag 48 
Met Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys 
15 10 15 

aag gtc age gee gaa ttg ate acc gee gee cgc gee ttg ggc gaa cca 96 
Lys Val Ser Ala Glu Leu He Thr Ala Ala Arg Ala Leu Gly Glu Pro 
20 25 30 

gee gec gtc gtc gtc ggt gtg ccg ggg acg gec gcg ccg ctg gtg gac 144 
Ala Ala Val Val Val Gly Val Pro Gly Thr Ala Ala Pro Leu Val Asp. 
35 40 45 

ggg ctt aag gcg get ggt gec gec aag ate tac gtc gec gag tec gac 192 
Gly Leu Lys Ala Ala Gly Ala Ala Lys He Tyr Val Ala Glu Ser Asp 
50 55 60 



ctt gtc gac aaa tac ctg ate acc ccg gcg gtc gac gtg ctg gee ggg 240 
Leu Val Asp Lys Tyr Leu He Thr Pro Ala Val Asp Val Leu Ala Gly 
65 ^ 70 75 80 



ctg gee gag tec teg gec cct gec ggc gta eta ate gec gec acc gcg 288 
Leu Ala Glu Ser Ser Ala Pro Ala Gly Val Leu He Ala Ala Thr Ala 
85 ~ 90 95 



gac ggc aag gag ate gee ggc cga ctt gcg get egg ate ggc teg ggt 336 

Asp Gly Lys Glu He Ala Gly Arg Leu Ala Ala Arg He Gly Ser Gly 

100 105 110 

ctg ctg gtc gac gtg gtc gac gtg aga gaa ggt gga gtg ggt gtc cac 384 

Leu Leu Val Asp Val Val Asp Val Arg Glu Gly Gly Val Gly Val His 

115 120 125 

age ate ttc ggt ggg gcg ttc acc gtc gaa gcg cag gee aac ggc gac 4 32 

Ser He Phe Gly Gly Ala Phe Thr Val Glu Ala Gin Ala Asn Gly Asp 

.130 " " 135 140 



acc ccg gtg ate acc gtg cgc gca gga gec gtg gag gcg gag ccg gec 



480 
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Thr Pro Val He Thr Val Arg Ala Gly Ala Val Glu Ala Glu Pro Ala 
145 150 155 160 

gcc ggc gcc ggt gag cag gtc age gtg gaa gtg ccg get gcg gcg gag 528 
Ala Gly Ala Gly Glu Gin Val Ser Val Glu Val Pro Ala Ala Ala Glu 
165 170 175 

aac gcc gcc agg ate acc gcg cgc gaa ccg gcg gtc gcc ggc gac egg 576 
Asn Ala Ala Arg He Thr Ala Arg Glu Pro Ala Val Ala Gly Asp Arg 
180 185 190 

ccg gag ctg acc gag gcg acc att gtg gtg gcc ggt ggc cgt ggt gtc 624 
Pro Glu Leu Thr Glu Ala Thr He Val Val Ala Gly Gly Arg Gly Val 
195 200 205 

ggc age gcg gag aac ttc age gtg gtc gag gcg ctg gcc gac teg ctg 672 
Gly Ser Ala Glu Asn Phe Ser Val Val Glu Ala Leu Ala Asp Ser Leu 
210 215 220 

ggc gcc gcg gtc ggg gcc teg cgt gcc gca gtc gac tec ggc tac tac 720 
Gly Ala Ala Val Gly Ala Ser Arg Ala Ala Val Asp Ser Gly Tyr Tyr 
225 230 235 240 

ccg ggc cag ttc cag gtc ggc cag acc ggc aag acg gtg teg ccc cag 768 
Pro Gly Gin Phe Gin Val Gly Gin Thr Gly Lys Thr Val Ser Pro Gin 
245 250 255 

etc tac att gcc ctg ggc ate tec ggg gcg ate cag cac cgc get ggc 816 
Leu Tyr He Ala Leu Gly He Ser Gly Ala He Gin His Arg Ala Gly 
260 265 270 

atg cag acg tec aag acc ate gtc gcg gtc aac aag gac gaa gag gcg 864 
Met Gin Thr Ser Lys Thr He Val Ala Val Asn Lys Asp Glu Glu Ala 
275 280 285 

ccg ate ttt gag ate gcc gac tac ggg gtg gtg gga gac ctg ttc aag 912 
Pro He Phe Glu He Ala Asp Tyr Gly Val Val Gly Asp Leu Phe Lys 
290 295 300 

gtc get ccg cag ctg acc gag gcc ate aag gcc cgc aag ggc 954 

Val Ala Pro Gin Leu Thr Glu Ala He Lys Ala Arg Lys Gly 
305 310 " 315 

tag 957 

<210> 36 
<211> 318 
<212> PRT 

<213> M. Tuberculosis 
<400> 36 

Met Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys 

1 5 10 15 

Lys Val Ser Ala Glu Leu He Thr Ala Ala Arg Ala Leu Gly Glu Pro 

20 25 30 

Ala Ala Val Val Val Gly Val Pro Gly Thr Ala Ala Pro Leu Val Asp 

35 40 45 

Gly Leu Lys Ala Ala Gly Ala Ala Lys He Tyr Val Ala Glu Ser Asp 

50 55 60 

Leu Val Asp Lys Tyr Leu He Thr Pro Ala Val Asp Val Leu Ala Gly 
65 70 75 80 
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Leu 


Ala 


Glu 


Ser 


Ser 


Ala 


Pro Ala 


Gly Val 


Leu 


He 


Ala 


Ala 


Thr 


Ala 










85 






90 










95 




Asp Gly 


Lys 


Glu 


He 


Ala 


Gly Arg 


Leu Ala 


Ala 


Arg 


He 


Gly 


Ser 


Gly 






100 








105 








110 






Leu 


Leu 


Val 


Asp 


Val 


Val 


Asp Val 


Arg Glu Gly 


Gly 


Val 


Gly 


Val 


His 






115 






120 








125 








Ser 


He 


Phe 


Gly 


Gly 


Ala 


Phe Thr 


Val Glu 


Ala 


Gin 


Ala 


Asn 


Gly 


Asp 




130 








135 






140 










Thr 


Pro 


Val 


He 


Thr 


Val 


Arg Ala 


Gly Ala 


Val 


Glu 


Ala 


Glu 


Pro 


Ala 


145 










150 






155 










160 


Ala 


Gly 


Ala 


Gly 


Glu 


Gin 


Val Ser 


Val Glu 


Val 


Pro 


Ala 


Ala 


Ala 


Glu 






165 






170 










175 




Asn 


Ala 


Ala 


Arg 


He 


Thr 


Ala Arg 


Glu Pro 


Ala 


Val 


Ala 


Gly 


Asp 


Arg 








180 








185 








190 






Pro 


Glu 


Leu 


Thr 


Glu 


Ala 


Thr He 


Val Val 


Ala 


Gly 


Gly 


Arg 


Gly 


Val 






195 








200 








205 








Gly 


Ser 


Ala 


Glu 


Asn 


Phe 


Ser Val 


Val Glu 


Ala 


Leu 


Ala 


Asp 


Ser 


Leu 


210 










215 






220 










Gly Ala 


Ala 


Val 


Gly 


Ala 


Ser Arg 


Ala Ala 


Val 


Asp 


Ser 


Gly 


Tyr 


Tyr 


225 










230 






235 










240 


Pro 


Gly 


Gin 


Phe 


Gin 


Val 


Gly Gin 


Thr Gly Lys 


Thr 


Val 


Ser 


Pro 


Gin 








245 






250 










255 




Leu 


Tyr 


He 


Ala 


Leu 


Gly 


He Ser 


Gly Ala 


He 


Gin 


His 


Arg 


Ala 


Gly 






260 








265 








270 






Met 


Gin 


Thr 


Ser 


Lys 


Thr 


He Val 


Ala Val 


Asn 


Lys 


Asp 


Glu 


Glu 


Ala 






275 








280 








285 








Pro 


He 


Phe 


Glu 


He 


Ala 


Asp Tyr 


Gly Val 


Val 


Gly 


Asp 


Leu 


Phe 


Lys 




290 










295 






300 










Val 


Ala 


Pro 


Gin 


Leu 


Thr 


Glu Ala 


He Lys Ala 


Arg 


Lys 


Gly 






305 










310 






315 













<210> 37 
<211> 1401 
<212> DNA 

<213> M. Tuberculosis 



<220> 

<221> CDS 

<222> (1) . . . (1398) 

<400> 37 

gtg aag age acc gtc gag cag ttg age ccc acc egg gtt cgt ate aac 48 
Val Lys Ser Thr Val Glu Gin Leu Ser Pro Thr Arg Val Arg He Asn 
15 10 15 

gtg gag gtg cca ttc gee gag ctt gag ccg gat ttc cag egg gec tac 96 
Val Glu Val Pro Phe Ala Glu Leu Glu Pro Asp Phe Gin Arg Ala Tyr 
20 25 30 

aaa gag ctg gee aaa cag gtg egg ctg ccc ggc ttc egg ccc ggg aag 144 
Lys Glu Leu Ala Lys Gin Val Arg Leu Pro Gly Phe Arg Pro Gly Lys 
35 40 45 

gcg ccg gee aaa eta etc gaa gee cgc ate ggc egg gag gee atg ctg 192 
Ala Pro Ala Lys Leu Leu Glu Ala Arg He Gly Arg Glu Ala Met Leu 
50 55 60 

gat caa ate gtc aac gat gcg ctg ccc age egg tac gga cag gcg gtg 240 
Asp Gin He Val Asn Asp Ala Leu Pro Ser Arg Tyr Gly Gin Ala Val 
65 70 75 80 
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gcc gag teg gat gtc caa ccg etc ggc egg ccc aac ate gag gtg ace 288 
Ala Glu Ser Asp Val Gin Pro Leu Gly Arg Pro Asn lie Glu Val Thr 
85 90 95 

aag aag gag tac ggc cag gac ctg caa ttc acc gcc gag gtc gac ate 336 
Lys Lys Glu Tyr Gly Gin Asp Leu Gin Phe Thr Ala Glu Val Asp He 
100 105 no 

cgc ccg aag ate agt ccc ccg gac ctg age gcg ctg acg gtc teg gtg 384 
Arg Pro Lys He Ser Pro Pro Asp Leu Ser Ala Leu Thr Val Ser Val 
115 120 125 

gat ccg ate gaa ate ggt gag gac gac gtc gac gcc gaa ctg cag teg 432 
Asp Pro He Glu He Gly Glu Asp Asp Val Asp Ala Glu Leu Gin Ser 
130 135 140 

tta cgt acc egg ttc ggc acc ctg acc gcg gtg gac egg ccg gtg gcc 480 
Leu Arg Thr Arg Phe Gly Thr Leu Thr Ala Val Asp Arg Pro Val Ala 
145 150 155 160 

gtc ggc gac gtc gtc teg ate gac ttg tct gcc acg gtc gac gga gag 528 
Val Gly Asp Val Val Ser He Asp Leu Ser Ala Thr Val Asp Gly Glu 
165 170 175 

gac ata ccg aac gca gcc get gag gga etc tec cac gag gtc ggc tec 576 
Asp He Pro Asn Ala Ala Ala Glu Gly Leu Ser His Glu Val Gly Ser 
180 185 190 

ggc egg etc ate gca ggt etc gac gac gcg gtt gtt ggt ctg tec gcc 624 
Gly Arg Leu He Ala Gly Leu Asp Asp Ala Val Val Gly Leu Ser Ala 
195 200 205 

gac gag tec egg gtc ttc acc gcc aag ctg gca gcc ggc gag cac gcc 672 
Asp Glu Ser Arg Val Phe Thr Ala Lys Leu Ala Ala Gly Glu His Ala 
210 215 220 

ggg cag gaa get cag gtt acc gtc acg gtc agg teg gtt aag gag cgc 720 
Gly Gin Glu Ala Gin Val Thr Val Thr Val Arg Ser Val Lys Glu Arg 
225 230 235 240 

gaa eta cca gag ccc gac gac gaa ttc gcg cag tta gcc age gag ttc 768 
Glu Leu Pro Glu Pro Asp Asp Glu Phe Ala Gin Leu Ala Ser Glu Phe 
245 250 255 

gac age ate gac gaa ttg egg gcc age etc age gac cag gtg cgc cag 816 
Asp Ser lie Asp Glu Leu Arg Ala Ser Leu Ser Asp Gin Val Arg Gin 
260 265 270 

gcc aag cgc gcc cag cag gcc gag cag att cga aac gcc acc ate gat 864 
Ala Lys Arg Ala Gin Gin Ala Glu Gin He Arg Asn Ala Thr lie Asp 
275 280 285 

gcg eta etc gaa cag gtc gac gtg ccg ttg ccg gag teg tat gtg cag 912 
Ala Leu Leu Glu Gin Val Asp Val Pro Leu Pro Glu Ser Tyr Val Gin 
290 295 300 

gcc caa ttc gac age gtg ctg cac age gcg etc age ggt ctt aat cac 960 
Ala Gin Phe Asp Ser Val Leu His Ser Ala Leu Ser Gly Leu Asn His 
305 310 315 320 



gac gaa gcc egg ttc aat gag ttg etc gtc gag caa ggc teg tea cgc 1008 
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Asp Glu Ala Arg Phe Asn Glu Leu Leu Val Glu Gin Gly Ser Ser Arg 
325 330 335 

gcg gcg ttc gat gcc gag gcg cgc acc gcc tea gaa aag .gac gtc aag 
Ala Ala Phe Asp Ala Glu Ala Arg Thr Ala Ser Glu Lys Asp Val Lys 
34 0 34 5 350 



1056 



agg cag ctg ttg eta gac gcc ctg gcc gat gag ctg cag gtc caa gtt 1104 
Arg Gin Leu Leu Leu Asp Ala Leu Ala Asp Glu Leu Gin Val Gin Val 
355 360 365 

ggc cag gat gat ctg acc gaa cga ctg gtg acg acg tct egg caa tac 1152 
Gly Gin Asp Asp Leu Thr Glu Arg Leu Val Thr Thr Ser Arg Gin Tyr 
370 375 380 

ggc ate gag ccg cag cag ctg ttc ggc tac etc caa gag cgc aac cag 1200 
Gly lie Glu Pro Gin Gin Leu Phe Gly Tyr Leu Gin Glu Arg Asn Gin 
385 390 395 400 

ctg ccg acc atg ttc get gac gtg egg cgc gag ctg gcg ate agg gcc 1248 
Leu Pro Thr Met Phe Ala Asp Val Arg Arg Glu Leu Ala lie Arg Ala 
405 410 415 

gca gtg gag gcg gcg acg gtc acc gac agt gac gga aac acg ate gat 1296 
Ala Val Glu Ala Ala Thr Val Thr Asp Ser Asp Gly Asn Thr lie Asp 
420 425 430 

acc agt gag ttc ttc ggc aag cgt gtg teg gcc ggt gag get gag gag 1344 
Thr Ser Glu Phe Phe Gly Lys Arg Val Ser Ala Gly Glu Ala Glu Glu 
435 440 445 

gcc gaa ccg gca gac gag ggt gcc gcg egg gcg gcg tec gac gaa gcg 1392 
Ala Glu Pro Ala Asp Glu Gly Ala Ala Arg Ala Ala Ser Asp Glu Ala 
450 455 460 

aca.acg tga 1401 

Thr Thr 

465 



<210> 38 
<2.11> 466 
<212> PRT 

<213> M. Tuberculosis 



<400> 38 



Met 


Lys 


Ser 


Thr 


Val 


Glu 


Gin 


Leu Ser Pro Thr Arg Val Arg He Asn 


1 






5 








10 


15 


Val 


Glu 


Val 


Pro 


Phe 


Ala 


Glu 


Leu 


Glu Pro Asp Phe 


Gin Arg Ala Tyr 








20 










25 


30 


Lys 


Glu 


Leu 


Ala 


Lys 


Gin 


Val 


Arg 


Leu Pro Gly Phe 


Arg Pro Gly Lys 






35 










40 




45 


Ala 


Pro 


Ala 


Lys 


Leu 


Leu 


Glu 


Ala 


Arg He Gly Arg 


Glu Ala Met Leu 




50 










55 




60 




Asp Gin 


He 


Val 


Asn 


Asp 


Ala 


Leu 


Pro Ser Arg Tyr 


Gly Gin Ala Val 


65 










70 






75 


80 


Ala 


Glu 


Ser 


Asp 


Val 


Gin 


Pro 


Leu 


Gly Arg Pro Asn 


He Glu Val Thr 










85 








90 


95 


Lys 


Lys 


Glu 


Tyr 


Gly 


Gin 


Asp 


Leu 


Gin Phe Thr Ala 


Glu Val Asp He 








100 










105 


110 


Arg 


Pro 


Lys 


He 


Ser 


Pro 


Pro Asp 


Leu Ser Ala Leu 


Thr Val Ser Val 
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115 120 125 

Asp Pro He Glu He Gly Glu Asp Asp Val Asp Ala Glu Leu Gin Ser 

130 135 140 

Leu Arg Thr Arg Phe Gly Thr Leu Thr Ala Val Asp Arg Pro Val Ala 
145 150 155 160 

Val Gly Asp Val Val Ser He Asp Leu Ser Ala Thr Val Asp Gly Glu 

165 170 175 

Asp He Pro Asn Ala Ala Ala Glu Gly Leu Ser His Glu Val Gly Ser 

180 185 190 

Gly Arg Leu He Ala Gly Leu Asp Asp Ala Val Val Gly Leu Ser Ala 

195 200 205 

Asp Glu Ser Arg Val Phe Thr Ala Lys Leu Ala Ala Gly Glu His Ala 

210 215 220 

Gly Gin Glu Ala Gin Val Thr Val Thr Val Arg Ser Val Lys Glu Arg 
225 230 235 ~ 240 

Glu Leu Pro Glu Pro Asp Asp Glu Phe Ala Gin Leu Ala Ser Glu Phe 

245 250 255 

Asp Ser He Asp Glu Leu Arg Ala Ser Leu Ser Asp Gin Val Arg Gin 

260 265 270 

Ala Lys Arg Ala Gin Gin Ala Glu Gin He Arg Asn Ala Thr He Asp 

275 280 285 

Ala Leu Leu Glu Gin Val Asp Val Pro Leu Pro Glu Ser Tyr Val Gin 

290 295 300 

Ala Gin Phe Asp Ser Val Leu His Ser Ala Leu Ser Gly Leu Asn His 
305 310 315 ^ 320 

Asp Glu Ala Arg Phe Asn Glu Leu Leu Val Glu Gin Gly Ser Ser Arg 

325 330 335 

Ala Ala Phe Asp Ala Glu Ala Arg Thr Ala Ser Glu Lys Asp Val Lys 

340 345 350 

Arg Gin Leu Leu Leu Asp Ala Leu Ala Asp Glu Leu Gin Val Gin Val 

355 360 * 365 

Gly Gin Asp Asp Leu Thr Glu Arg Leu Val Thr Thr Ser Arg Gin Tyr 

370 375 380 

Gly He Glu Pro Gin Gin Leu Phe Gly Tyr Leu Gin Glu Arg Asn Gin 
385 390 395 " 400 

Leu Pro Thr Met Phe Ala Asp Val Arg Arg Glu Leu Ala He Arg Ala 

405 410 415 

Ala Val Glu Ala Ala Thr Val Thr Asp Ser Asp Gly Asn Thr He Asp 

420 425 " 430 

Thr Ser Glu Phe Phe Gly Lys Arg Val Ser Ala Gly Glu Ala Glu Glu 

435 440 ~ 445 

Ala Glu Pro Ala Asp Glu Gly Ala Ala Arg Ala Ala Ser Asp Glu Ala 

450 455 460 

Thr Thr 
465 

<210> 39 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 39 

Thr Glu Arg Thr Ala Val Leu lie Lys Pro Asp Gly He Glu Arg 
1 5 10 15 

<210> 40 

<211> 15 

<212> PRT 

<213> M. Tuberculosis 



<400> 40 
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Thr Asp Thr Gin Val Thr Trp Leu Thr Gin Glu Ser His Asp Arg 
15 10 15 

<210> 41 
<211> 15 
<212> PRT 

<213> M. Tuberculosis \ 
<400> 41 

Met lie Asp Glu Ala Leu Phe Asp Ala Glu Glu Lys Met Glu Lys 
1 5 10 15 

<210> 42 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 42 

Pro Leu Pro Ala Asp Pro Ser Thr Asp Leu Ser Ala Tyr Ala Gin 
15 10 15 

<210> 43 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 43 

Met Leu lie Ser Gin Arg Pro Thr Leu Ser Glu Asp Val Leu Thr 
15 10 15 

* <210> 44 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 44 

Thr Gly Asn Leu Val Thr Lys Asn Ser Leu Thr Pro Asp Val Arg 
1 5 10 15 

<210> 45 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 45 

Met Glu Val Lys lie Gly lie Thr Asp Ser Pro Arg Glu Leu Val 
1 5 10 15 

<210> 46 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 46 

Ser Ala Tyr Lys Thr Val Val Val Gly Thr Asp Asp Xaa Ser Xaa 
1 5 10 15 

<210> 47 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
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<400> 47 

Met Glu Gin Arg Ala Glu Leu Val Val Gly Arg Ala Leu Val Val 
1 5 10 15 

<210> 48 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 4 8 

Ala Asp lie Asp Gly Val Thr Gly Ser Ala Gly Leu Asn Pro Ala 
15 10 15 

<210> 49 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 49 

Thr Tyr Glu Thr lie Leu Val Glu Arg Asp Gin Arg Val Gly lie 
1 5 10 15 

<210> 50 
<211> 15 
<212> PRT 

<213> M . Tuberculosis 
<400> 50 

Pro Val Thr Gin Glu Glu He He Ala Gly He Ala Glu He He 
15 10 15 

<210> 51 
<211> 14 
<212> PRT 

<213> M. Tuberculosis 
<400> 51 

Pro Val Val Lys He Asn Ala He Glu Val Pro Ala Gly Ala 
15 10 

<210> 52 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 52 

Ala Asp Lys Thr Thr Gin Thr He Tyr He Asp Ala Asp Pro Gly 
15 10 15 

<210> 53 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 53 

Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser 
1 5 10 15 

<210> 54 
<211> 14 
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<212> PRT 

<213> M. Tuberculosis 
<400> 54 

Ser Gly Asn Ser Ser Leu Gly He He Val Gly lie ASp Asp 
15 10 



<210> 55 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 55 

Ala Glu Val' Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys 
15 10 15 



<210> 56 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 



<400> 56 

Met Lys Ser Thr Val Glu Gin Leu Ser Pro Thr Arg Val Arg He 
1 5 10 15 

<210> 57 
<211> 11 
<212> PRT 

<213> M. Tuberculosis 
<400> 57 

Val He Arg Arg Lys Pro Lys Pro Arg Xaa Arg 
15 10 



<210> 58 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 58 

ctgagatctg tggaggtcaa gatcggt 27 

<210> 59 
<211> 31 
<212> DNA 

<213> M. Tuberculosis 
<400> 59 

ctcccatggc tacttacccg ctcgtagcaa c 31 

<210> 60 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 60 

ctgagatctc ctgtcactca ggaagaa 27 



<210> 61 
<211> 27 
<212> DNA 
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<213> M. Tuberculosis 

<400> 61 
ctcccatggg aaaccgccat tagcggt 

<210> 62 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 

<400> 62 
cccaagctta tggaacagcg tgcggag 

<210> 63 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 

<400> 63 
ctcccatggc gacactcgat ccggatt 

<210> 64 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 

<400> 64 
ctgagatcta tgccagtggt gaagatc 

<210> 65 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 

<400> 65 
ctcccatggt tatgcagtct tgccggt 

<210> 66 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 

<400> 66 
ctgagatctg cggacaagac gacacag 

<210> 67 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
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40 



27 



27 



27 



27 



27 



27 



<400> 67 
ctcccatggt accggaatca ctcagcc 

<210> 68 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 



27 



<400> 68 
ctgagatctc cagttttgag caagacc 



27 



WO 00/21983 



41 
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<210> 69 
<211> 27 
<212> DNA 

<213> M. Tuberculosis ' 

<400> 69 
ctcccatggg cacatgcctt agctggc 

<210> 70 

<211> 27 

<212> DNA 

<213> M. Tuberculosis 

<400> 70 
ctgagatcta tgtcatcggg caattca 



<210> 71 
<211> 31 
<212> DNA 

<213> M. Tuberculosis 
<400> 71 

ctcccatggc tacctaagtc agcgactcgc g 



31 



<210> 72 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 

<400> 72 
ctgagatctg tgaagagcac cgtcgag 

<210> 73 

<211> 27 

<212> DNA 

<213> M. Tuberculosis 

<400> 73 
ctcccatggg tcatacggtc acgttgt 



<210> 74 
<211> 348 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1)...(348) 

<400> 74 

atg gca etc aag gta gag atg gtc act ttc gac tgc age gac cct gcg 48 
Met Ala Leu Lys Val Glu Met Val Thr Phe Asp Cys Ser Asp Pro Ala 
1 5 10 15 

aag ctt gec ggc tgg tgg gec gag cag ttc gat ggc acg acg cgt gaa 96 
Lys Leu Ala Gly Trp Trp Ala Glu Gin Phe Asp Gly Thr Thr Arg Glu 
20 25 30 

ctg ctg ccc ggc gaa ttc gtc gtg gtc gec egg acc gat gga ccg egg 144 
Leu Leu Pro Gly Glu Phe Val Val Val Ala Arg Thr Asp Gly Pro Arg 
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35 40 45 

ttg gga ttc cag aag gtg ccc gat ccc gcc cct ggg aaa aac cgc gtg 192 
Leu Gly Phe Gin Lys Val Pro Asp Pro Ala Pro Gly Lys Asn Arg Val 
50 55 60 

cac etc gac ttc acg acc aag gac ctg gat gcc gag gtg ttg cgc ctg 240 
His Leu Asp Phe Thr Thr Lys Asp Leu Asp Ala Glu Val Leu Arg Leu 
65 70 75 80 

gtc gcc gcc gga gcc agt gag gtc ggg egg cat cag gtc ggc gag age 288 
Val Ala Ala Gly Ala Ser Glu Val Gly Arg His Gin Val Gly Glu Ser 
85 90 95 

ttt cgc tgg gtg gtg ctg get gac ccc gaa ggc aac get ttt tgc gtg 336 
Phe Arg Trp Val Val Leu Ala Asp Pro Glu Gly Asn Ala Phe Cys Val 
100 105 110 

gcg ggt caa taa 34 8 

Ala Gly Gin * 
115 



<210> 75 
<211> 115 
<212> PRT 

<213> M. Tuberculosis 
<400> 75 



Met 


Ala 


Leu 


Lys 


Val 


Glu 


Met 


Val 


Thr Phe 


Asp Cys 


Ser Asp Pro 


Ala 


1 








5 








10 






15 




Lys 


Leu 


Ala 


Gly 


Trp 


Trp 


Ala 


Glu 


Gin Phe 


Asp Gly 


Thr 


Thr Arg 


Glu 








20 










25 






30 




Leu 


Leu 


Pro 


Gly 


Glu 


Phe 


Val 


Val 


Val Ala 


Arg Thr 


Asp 


Gly Pro Arg 






35 










40 






45 






Leu 


Gly 


Phe 


Gin 


Lys 


Val 


Pro 


Asp 


Pro Ala 


Pro Gly 


Lys 


Asn Arg 


Val 




50 










55 






60 








His 


Leu 


Asp 


Phe 


Thr 


Thr 


Lys 


Asp 


Leu Asp 


Ala Glu 


Val 


Leu Arg 


Leu 


65 










70 








75 






80 


Val 


Ala 


Ala 


Gly 


Ala 


Ser 


Glu 


Val 


Gly Arg 


His Gin 


Val 


Gly Glu 


Ser 










85 








90 






95 




Phe 


Arg 


Trp 


Val 


Val 


Leu 


Ala 


Asp 


Pro Glu 


Gly Asn Ala 


Phe Cys 


Val 



100 105 110 



Ala Gly Gin 
115 

<210> 76 
<211> 564 
<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . . (564) 

<400> 76 

atg gcc gac get gac acc acc gac ttc gac gtc gac gca gaa gca ccg 48 

Met Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro 
15 10 15 



ggt gga ggc gtc egg gag gac acg gcg acg gat get gac gag gcc gac 



96 
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Gly Gly Gly Val Arg Glu Asp Thr Ala Thr Asp Ala Asp Glu Ala Asp 
20 25 30 

gat caa gaa gag aga ttg gtc gcc gag ggc gag att gca ggc gac tac 14 4 

Asp Gin Glu Glu Arg Leu Val Ala Glu Gly Glu lie Ala Gly Asp Tyr 
35 40 45 

ctg gaa gag tta ttg gac gtg ttg gac ttc gat ggc gac ate gac etc 192 
Leu Glu Glu Leu Leu Asp Val Leu Asp Phe Asp Gly Asp lie Asp Leu 
50 * 55 60 

gat gtc gaa ggc aat cgt gcg gtg gtg age ate gac ggc agt gac gac 240 
Asp Val Glu Gly Asn Arg Ala Val Val Ser lie Asp Gly Ser Asp Asp 
65 70 75 80 

ctg aac aag ttg gtc ggg cgc ggg ggc gag gtg etc gac get ctg cag 288 
Leu Asn Lys Leu Val Gly Arg Gly Gly Glu Val Leu Asp Ala Leu Gin 
85 ^ 90 95 

gaa etc acc egg ttg gcg gtg cat cag aag ace ggt gtg egg age egg 336 
Glu Leu Thr Arg Leu Ala Val His Gin Lys Thr Gly Val Arg Ser Arg 
100 105 110 

ttg atg eta gac ate gcg agg tgg cga egg egg cgc egg gag gaa ttg 384 
Leu Met Leu Asp lie Ala Arg Trp Arg Arg Arg Arg Arg Glu Glu Leu 
115 * 120 125 

gcg gcg ctg gcc gac gag gtg gcg egg cga gtg gcc gaa acc ggt gac 432 
Ala Ala Leu Ala Asp Glu Val Ala Arg Arg Val Ala Glu Thr Gly Asp 
130 135 140 

cgc gag gaa etc gtt cca atg acg ccg ttc gaa egg aag ate gtc cac 480 
Arg Glu Glu Leu Val Pro Met Thr Pro Phe Glu Arg Lys lie Val His 
145 150 155 160 

gat gcg gtt gca gcg gtg cca ggt gtg cac age gaa age gaa ggc gtg 528 
Asp Ala Val Ala Ala Val Pro Gly Val His Ser Glu Ser Glu Gly Val 
165 170 175 



gag cca gaa cgc cga gtc gtt gtg etc cgc gac tag - 564 
Glu Pro Glu Arg Arg Val Val Val Leu Arg Asp 
180 185 



<210> 


77 








<211> 


187 








<212> 


PRT 








<213> 


M. Tuberculosis 


<400> 


77 








Met Ala Asp 


Ala 


Asp 


Thr 


Thr Asp 


1 




5 






Gly Gly Gly 


Val 


Arg 


Glu 


Asp Thr 




20 








Asp Gin Glu 


Glu 


Arg 


Leu 


Val Ala 


35 








40 


Leu Glu Glu 


Leu 


Leu 


Asp 


Val Leu 


50 








55 


Asp Val Glu 


Gly 


Asn 


Arg 


Ala Val 


65 






70 




Leu Asn Lys 


Leu 


Val 


Gly 


Arg Gly 



Phe 


Asp 


Val 


Asp 


Ala 


Glu 


Ala 


Pro 




10 










15 




Ala 


Thr 


Asp 


Ala 


Asp 


Glu 


Ala 


Asp 


25 










30 






Glu 


Gly 


Glu 


He 


Ala 


Gly. 


Asp 


Tyr 










45 








Asp 


Phe 


Asp 


Gly 


Asp 


He 


Asp 


Leu 








60 










Val 


Ser 


He 


Asp 


Gly 


Ser 


Asp 


Asp 






75 










80 


Gly 


Glu 


Val 


Leu 


Asp 


Ala 


Leu 


Gin 
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85 






90 






95 


Glu 


Leu 


Thr Arg 


Leu 


Ala 


Val 


His Gin Lys Thr Gly Val 


Arg 


Ser Arg 








100 








105 




110 




Leu 


Met 


Leu 
115 


Asp 


He 


Ala 


Arg 


Trp Arg Arg Arg Arg 
120 


Arg 
125 


Glu 


Glu Leu 


Ala 


Ala 
130 


Leu 


Ala 


Asp 


Glu 


Val 
135 


Ala Arg Arg Val Ala 
140 


Glu 


Thr 


Gly Asp 


Arg 


Glu 


Glu 


Leu 


Val 


Pro 


Met 


Thr Pro Phe Glu Arg 


Lys 


He 


Val His 


145 










150 




155 






160 


Asp Ala 


Val 


Ala 


Ala 


Val 


Pro Gly Val His Ser Glu 


Ser 


Glu Gly Val 










165 






170 






175 


Glu 


Pro 


Glu 


Arg 
180 


Arg 


Val 


Val 


Val Leu Arg Asp 
185 









<210> 78 

<211> 1167 

<212> DNA 

<213> M. Tuberculosis 

<220> 

<221> CDS 

<222> (1) . . . (1167) 

<400> 78 

atg age aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg acc 4 8 

Met Ser Lys Thr Val Leu He Leu Gly Ala Gly Val Gly Gly Leu Thr 
15 10 15 

acc gee gac acc etc cgt caa ctg eta cca cct gag gat cga ate ata 96 
Thr Ala Asp Thr Leu Arg Gin Leu Leu Pro Pro Glu Asp Arg He He 
20 25 ^ 30 

ttg gtg gac agg age ttt gac ggg acg ctg ggc ttg teg ttg eta tgg 144 
Leu Val Asp Arg Ser Phe Asp Gly Thr Leu Gly Leu Ser Leu Leu Trp 
35 40 45 

gtg ttg egg ggc tgg egg egg cct gac gac gtc cgc gtc cgc ccc acc 192 
Val Leu Arg Gly Trp Arg Arg Pro Asp Asp Val Arg Val Arg Pro Thr 
50 55 60 

gcg gcg teg ctg ccc ggt gtg gaa atg gtt act gca acc gtc gee cac 240 
Ala Ala Ser Leu Pro Gly Val Glu Met Val Thr Ala Thr Val Ala His 
65 70 75 80 

att gac ate gcg gee cag gta gtg cac acc gac aac age gtc ate ggc 288 
He Asp lie Ala Ala Gin Val Val His Thr Asp Asn Ser Val He Gly 
85 90 95 



tat gac gcg ttg gtg ate 
Tyr Asp Ala Leu Val He 
100 

gtt ccc gga ctg teg gac 
Val Pro Gly Leu Ser Asp 
115 

tac acc ctg gac ggc gcg 
Tyr Thr Leu Asp Gly Ala 
130 



gca tta ggt gcg gcg ctg 
Ala Leu Gly Ala Ala Leu 
105 

gcg etc gac gee gac gtc 
Ala Leu Asp Ala Asp Val 
120 

get gag ctg cgt gcg aag 
Ala Glu Leu Arg Ala Lys 
135 140 



aac acc gac gee 336 
Asn Thr Asp Ala 
110 

gcg ggc cag ttc 384 

Ala Gly Gin Phe 

125 

gtc gag gcg etc 432 
Val Glu Ala Leu 



gag cat ggc egg ate get gtg get ate gee ggg gtg ccg ttc aaa tgc 



480 
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Glu His Gly Arg He Ala Val Ala He Ala Gly Val Pro Phe Lys Cys 

145 1 150 155 160 



cca gcc gca ccg ttc gaa gcg gcg ttt ctg ate gec gec caa etc ggt 
Pro Ala Ala Pro Phe Glu Ala Ala Phe Leu He Ala Ala Gin Leu Gly 
165 170 175 



528 



gac cgc tac gcc acc gga acc gta cag ate gac acg ttc acg cct gac 576 
Asp Arg Tyr Ala Thr Gly Thr Val Gin He Asp Thr Phe Thr Pro Asp 
180 185 190 

ccg ctg ccg atg ccc gtt gca ggt ccc gag gtc ggc gag get ttg gtc 624 
Pro Leu Pro Met Pro Val Ala Gly Pro Glu Val Gly Glu Ala Leu Val 
195 • 200 205 

teg atg etc aag gat cac ggt gtc ggc ttc cat cct cgc aag gcc eta 672 
Ser Met Leu Lys Asp His Gly Val Gly Phe His Pro Arg Lys Ala Leu 
210, 215 220 

get cgc gtc gat gag gcc gca agg acg atg cac ttc ggt gac ggc acg 720 
Ala Arg Val Asp Glu Ala Ala Arg Thr Met His Phe Gly Asp Gly Thr 
225 230 235 240 

tec gaa ccg ttc gat ctg ctt gcc gtg gtc ccc ccg cac gtg ccc tec 768 
Ser Glu Pro Phe Asp Leu Leu Ala Val Val Pro Pro His Val Pro Ser 
245 250 255 

gcc gcg gcg egg tea gcg ggt etc age gaa tec ggg tgg ata ccc gtg 816 
Ala Ala Ala Arg Ser Ala Gly Leu Ser Glu Ser Gly Trp He Pro Val 
260 265 270 

gac ccg cgc acc ctg tec act age gcc gac aac gtg tgg gcc ate ggc 864 
Asp Pro Arg Thr Leu Ser Thr Ser Ala Asp Asn Val Trp Ala lie Gly 
275 280 285 

gat gcg acc gtg ctg acg ctg. ccg aat ggc aaa ccg ctg ccc aag get 912 
Asp Ala Thr Val Leu Thr Leu Pro Asn Gly Lys Pro Leu Pro Lys Ala 
290 295 300 

gcc gtg ttc gcc gaa gcc cag gcc gca gtt gtc gcc cac ggc gtc gcc 960 
Ala Val Phe Ala Glu Ala Gin Ala Ala Val Val Ala His Gly Val Ala 
305 310 315 320 

cgc cat etc ggt tac gac gta get gag cgc cac ttc acc ggc acg ggc 1008 
Arg His Leu Gly Tyr Asp Val Ala Glu Arg His Phe Thr Gly Thr Gly 
325 330 335 

gcc tgc tac gtc gag acc ggt gat cac cag gca gcc aag ggc gac ggc 1056 
Ala Cys Tyr Val Glu Thr Gly Asp His Gin Ala Ala Lys Gly Asp Gly 
340 345 350 

gat ttc ttc get ccg teg gcg ccc teg gtg acg ctg tac ccg ccg teg 1104 
Asp Phe Phe Ala Pro Ser Ala Pro Ser Val Thr Leu Tyr Pro Pro Ser 
355 360 365 

egg gag ttt cac gag gag aag gtc gca caa gaa ctg gcc tgg ctg acc 1152 
Arg Glu Phe His Glu Glu Lys Val Ala Gin Glu Leu Ala Trp Leu Thr 
370 375 380 

cgc tgg aag acg tga 1167 
Arg Trp Lys Thr * 
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<210> 79 
<211> 388 
<212> PRT 

<213> M. Tuberculosis 





<400> 


79 












Met 


Ser 


Lys 


Thr 


Val 


Leu 


He 


Leu Gly 


1 








5 










Thr 


Ala 


Asp 


Thr 


Leu 


Arg 


Gin 


Leu 


Leu 








20 










25 


Leu 


Val 


Asp 


Arg 


Ser 


Phe 


Asp 


Gly 


Thr 






35 










40 




Val 


Leu 


Arg 


Gly 


Trp 


Arg 


Arg 


Pro 


Asp 




50 










55 






Ala 


Ala 


Ser 


Leu 


Pro 


Gly 


Val 


Glu 


Met 


65 










70 








lie Asp 


He 


Ala 


Ala 


Gin 


Val 


Val 


His 










85 










Tyr Asp Ala 


Leu 


Val 


He 


Ala 


Leu 


Gly 








100 










105 


Val 


Pro 


Gly 


Leu 


Ser 


Asp 


Ala 


Leu 


Asp 






115 










120 




Tyr 


Thr 


Leu 


Asp 


Gly 


Ala 


Ala 


Glu 


Leu 




130 










135 






Glu 


His 


Gly 


Arg 


He 


Ala 


Val 


Ala 


lie 


145 










150 








Pro 


Ala 


Ala 


Pro 


Phe 


Glu 


Ala 


Ala 


Phe 










165 










Asp Arg 


Tyr 


Ala 


Thr 


Gly 


Thr 


Val 


Gin 








180 










185 


Pro 


Leu 


Pro 


Met 


Pro 


Val 


Ala 


Gly 


Pro 






195 










200 




Ser 


Met 


Leu 


Lys 


Asp 


His 


Gly 


Val 


Gly 




210 










215 






Ala Arg 


Val 


Asp 


Glu 


Ala 


Ala 


Arg 


Thr 


225 










230 








Ser 


Glu 


Pro 


Phe 


Asp 


Leu 


Leu 


Ala 


Val 










245 










Ala 


Ala 


Ala 


Arg 


Ser 


Ala 


Gly 


Leu 


Ser 








260 










265 


Asp 


Pro 


Arg 


Thr 


Leu 


Ser 


Thr 


Ser 


Ala 






275 










280 




Asp Ala 


Thr 


Val 


Leu 


Thr 


Leu 


Pro 


Asn 




290 










295 






Ala 


Val 


Phe 


Ala 


Glu 


Ala 


Gin 


Ala 


Ala 


305 










310 








Arg 


His 


Leu 


Gly 


Tyr 


Asp 


Val 


Ala 


Glu 










325 










Ala 


Cys 


Tyr 


Val 


Glu 


Thr 


Gly 


Asp 


His 








340 










345 


Asp 


Phe 


Phe 


Ala 


Pro 


Ser 


Ala 


Pro 


Ser 






355 










360 




Arg 


Glu 


Phe 


His 


Glu 


Glu 


Lys 


Val 


Ala 




370 










375 






Arg Trp 


Lys 


Thr 
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Ala 


Gly 


Val 


Gly 


Gly 


Leu 


Thr 


10 










15 




Pro 


Pro 


Glu 


Asp 


Arg 


He 


He 










30 






Leu 


Gly 


Leu 


Ser 


Leu 


Leu 


Trp 








45 








Asp 


Val 


Arg 


Val 


Arg 


Pro 


Thr 






60 










Val 


Thr 


Ala 


Thr 


Val 


Ala 


His 




75 










80 


Thr 


Asp 


Asn 


Ser 


Val 


lie 


Gly 


90 










95 




Ala 


Ala 


Leu 


Asn 


Thr 


Asp 


Ala 










110 






Ala 


Asp 


Val 


Ala 


Gly 


Gin 


Phe 








125 








Arg 


Ala 


Lys 


Val 


Glu 


Ala 


Leu 






140 










Ala 


Gly 


Val 


Pro 


Phe 


Lys 


Cys 




155 










160 


Leu 


He 


Ala 


Ala 


Gin 


Leu 


Gly 


170 










175 




He 


Asp 


Thr 


Phe 


Thr 


Pro 


Asp 










190 






Glu 


Val 


Gly 


Glu 


Ala 


Leu 


Val 








205 








Phe 


His 


Pro 


Arg 


Lys 


Ala 


Leu 






220 










Met 


His 


Phe 


Gly 


Asp 


Gly 


Thr 




235 










240 


Val 


Pro 


Pro 


His 


Val 


Pro 


Ser 


250 










255 




Glu 


Ser 


Gly 


Trp 


He 


Pro 


Val 










270 






Asp 


Asn 


Val 


Trp 


Ala 


He 


Gly 








285 








Gly 


Lys 


Pro 


Leu 


Pro 


Lys 


Ala 






300 










Val 


Val 


Ala 


His 


Gly 


Val 


Ala 




315 










320 


Arg 


His 


Phe 


Thr 


Gly 


Thr 


Gly 


330 










335 




Gin 


Ala 


Ala 


Lys 


Gly 


Asp 


Gly 










350 






Val 


Thr 


Leu 


Tyr 


Pro 


Pro 


Ser 








365 








Gin 


Glu 


Leu 


Ala 


Trp 


Leu 


Thr 



380 



<210> 80 
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<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 80 

Ala Leu Lys Val Glu Met Val Thr Phe Asp Xaa Ser Asp Pro Ala 
1 5 10 15 

<210> 81 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 81 

Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro 
15 10 15 

<210> 82 
<211> 15 
<212> PRT 

<213> M. Tuberculosis 
<400> 82 

Ser Lys Thr Val Leu He Leu Gly Ala Gly Val Gly Gly Leu Thr 
1 5 10 15 

<210> 83 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 83 

ctgagatcta tggcactcaa ggtagag 27 

<210> 84 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 84 

ctcccatggt tattgacccg ccacgca 27 

<210> 85 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 85 

ctgagatcta tggccgacgc tgacacc 27 

<210> 86 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 86 

ctcccatggc tagtcgcgga gcacaac 27 



<210> 87 
<211> 27 
<212> DNA 
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48 



9 



<213> M. Tuberculosis 



<400> 87 
ctgagatcta tgagcaagac ggttctc 



27 



<210> 88 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 88 

ctcccatggt cacgtcttcc agcgggt 27 

<210> 89 
<211> 28 
<212> DNA 

<213> M. Tuberculosis 



<210> 90 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 90 

ctgaagctta tgagcgccta taagacc 27 

<210> 91 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 91 

ctgagatcta tgattgatga ggctctc 27 

<210> 92 
<211> 27 
<212> DNA 

<213> M. Tuberculosis 
<400> 92 

ctcccatgga gcggccgcta gacctcc 27 

<210> 93 
<211> 30 
<212> DNA 

<213> M. Tuberculosis 
<400> 93 

ggctgagact catggccgac atcgatggtg 30 

<210> 94 
<211> 31 
<212> DNA 

<213> M. Tuberculosis 



<400> 89 
ctgccatggc taggtggtgt gcacgatc 



28 



<400> 94 

cgtaccatgg tcatgacgac accccctcgt g 



31 
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<210> 95 
<211> 30 
<212> DNA 

<213> M. Tuberculosis 



<400> 95 

ggctgagact catggctgaa gtactggtgc 30 

<210> 96 
<211> 31 
<212> DNA 

<213> M. Tuberculosis 
<400> 96 

cgtaccatgg ctagccggcg accgccggtt c 31 

<210> 97 
<211> 20 
<212> DNA 

<213> M. Tuberculosis 
<400> 97 

gtgaccgaac ggactctggt 20 

<210> 98 
<211> 21 
<212> DNA 

<213> M. Tuberculosis 
<400> 98 

ctaggcgccg ggaaaccaga g 21 

<210> 99 
<211> 23 
<212> DNA 

<213> M. Tuberculosis 



<400> 99 
atgacggata ctcaagtcac ctg 

<210> 100 
<211> 20 
<212> DNA 

<213> M. Tuberculosis 

<400> 100 
ggagtggtac ggctcggcgc 

<210> 101 
<211> 20 
. <212> DNA 

<213> M. Tuberculosis 



<400> 101 
atgacgtacg aaaccatcct 

<210> 102 
<211> 21 
<212> DNA 

<213> M. Tuberculosis 
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50 



<400> 102 
tcatcggtgg gtgaactggg g 



21 



<210> 103 
<211> 23 
<212> DNA 

<213> M. Tuberculosis 
<400> 103 

atgccgcttc ccgcagaccc tag 23 

<210> 104 
<211> 21 
<212> DNA 

<213> M. Tuberculosis 



<210> 105 
<211> 22 
<212> DNA 

<213> M. Tuberculosis 
<400> 105 

atgctgatct cacagcgccc ca 22 

<210> 106 
<211> 22 
<212> DNA 

<213> M. Tuberculosis 



<210> 107 
<211> 20 
<212> DNA 

<213> M* Tuberculosis 
<400> 107 

atgaccggaa atttggtgac 20 

<210> 108 
<211> 21 
<212> DNA 

<213> M. Tuberculosis 



<400> 104 
tacgacgggt accactcctg g 



21 



<400> 106 
aagctgttcg gtttcggcgt ag 



22 



<400> 108 
tcagtagcgg tagtggtccg g 



21 
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